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Tics cs ssssssssssenssesesstateesnanmesesennereaamesse ego JiAlg eu Jai 
Face Mask Detection with Machine (JDi ici &o angl eië éluriAl (2 
PR RS dd Learning 
a à à angl elié éUbiAl (8 éonëo 
Le SR QD plc plait angll ei Sli Al alor 
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Chatbot with Machine Learning gb JU mic &o JD 210 (5 

Bis name ent ne en Me ets Len tn nee fr and Python 
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Pneumonia Detection with gÙl &o 59jyl UlaiUi ous (1 
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O9 plain 59 I lai SU JD roll eg J1ino 
2 plaseul (65 DIN GEST Je AI LS pose O2 55 ge dll Ha 3 
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from fastai.vision import * 


O9 &o Sgi HI ail au 


from fastai.metrics import 
error _ rate 

import os 

import pandas as pd 


import numpy as np 


apall SSI de peus ès te Lo pall US de goes lue olie) LI ON Etes 


D 

x = ‘Path' 
path = Path(x) 
path.ls( 
GAN DIN GEST Loge) LI el 55e 3 coul SUR ass ON LE 

0 tb pleut 
np.random.seed(48) 
data = ImageDataBunch.from folder(path, train = ‘.', valid _pct=0.2, 


ds_tfms=get_ transforms(), size=224, 


num_workers=4).normalize(imagenet_stats) 
DL éUA 


JS see ecrit cales G.jpg las La Lyonsenul 21 OULQI eo some 0055 pe 
Msoll Goal LL mul le Age 
os pal Je ImageDataBunch.from folder() 41 phil Ji rte 
cé Leïel,3 ci GI Mol ul Le Lu el Le 

data.show batch(rows=3, figsize=(7,6),recompute_scale factor=True) 


PNEUMONIA NORMAL PNEUMONIA 
= 


PNEUMONIA PNEUMONIA 
ee .. 


PNEUMONIA NORMAL 
g 


U9 DU &Ë) polaà ul 59f I al cc ii AU QU role il piaàüul 
RAI Ge g5 325 «ResNet50 pub 05,4 Gone ao6 à5 Se paisule «Ni 
gs nl la pleut LES (s 5 Geo NI NI Lil GONN Li exil 


learn =  cnn learner(data, models.resnet50, metrics=[accuracy], 
model_dir = Path('Path'),path = Path(".")) 


5 learning rate xl Jise le 5,15 Lab OI Les 


learn.lr find() 


learn.recorder.plot(suggestions=True) 


1e-06 1e-05 1e-04 1e-03 1e-02 le-01 1e+00 
Leaming Rate 


eigoil Juislg Lu yai 
SU Le sene Je pisll cup pile ON 5 sel (Lane Las cool ul 
1 Lo 


1ri = 1e-3 
1r2 = 1e-1 
learn.fit_one cycle(4,slice(lr1,lr2)) 


# lr1i = 1e-3 
1r = 1e-1 
learn.fit_one cycle(28,slice(lr)) 


learn.unfreeze() 
learn.lr find() 
learn.recorder.plot() 


learn.fit_one cycle(18,slice(1e-4,1e-3)) 


anis oi a 


learn.recorder.plot _losses() 


gi pil 5 Les 


interp = ClassificationiInterpretation.from learner(learn) 


interp.plot_ confusion matrix() 
img = open_image('IM-0001-0001.jpeg') 


print(learn.predict(img)[@]) 
NORMAL 


DS of Jef EN DEN ge CAES NI Let pleut Les Ses a 4èe Le Léa ok 
0 fl plat 665 DIN GUEST Je CNT ed 5 ve 32 Al ad of 15 
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Face Mask Detection with (Qi miciül &o angl lié sui (2 
Machine Learning 
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SN pb plat al US GUEST Joe Losulell LU paie EU pale au 


angl ei éUiiiAl(,8 &onëo 
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Le 55 2019 ca CLAS | Ju Lgfleo JR (6 grues Lilanall ele ed! 0355 Le LE 
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semantic AJYAI Sim se Lan beie MSI (og le Lol le Juaxil ol 
7e) US JS Lans us des À NI 8,2) segmentation 
QI I UI pol . LU angl lié tél  … | .. ul 

oo ar SL pl ul :1 8 sl 

els Do ban sf ELS Gesell ès Je Lènaall Cuyi5 12 5 slasull 

SSD «> Ji Li plait SULAI S eLf 0 52 411 GUEST :3 5 sas 

RERO] os CAD ce pholl | phil 145 sla>)| 
us FSSD ax gl CS ça Le Sal le cool 3 Sol Aa ve JU 5 al à 
Single Shot Multibox 5413) al 55 pslloll six LAëlS ee 5e SSD 
8 9 dues en phèeul jy peil de CLESN ps Eur oja .Detector 
D Res VGG-16 43 Lulul LS plu 3, pal Ge LAN je RAS alu sy 
ce Faster R-CNN; YOLO fe GENS 1e Ca ser de SSD Go 
BAM; de | Eee 


dre aol eus as 


QI roll &o angl elië élUiiAl 
094 Le y A past NI sl plais ax sl p US CLAST Loges LAS Les ol 


ogall ohg) lerlons al dns 2 D Ab Re sul Gb ce ogoll os Lite 


CDÜUUI àcgon0 JLoni 


import pandas as pd 
import numpy as np 
import cv2 
import json 
import os 
import matplotlib.pyplot as plt 
import random 
import seaborn as sns 
from keras.models import Sequential 
from keras import optimizers 
from keras import backend as K 
from keras.layers import Dense, Dropout, Activation, Flatten 
from keras.layers import Conv2D, MaxPooling2D, BatchNormalization 
from sklearn.model selection import train test split 
from keras.preprocessing.image import ImageDataGenerator 
directory = "../input/face-mask-detection-dataset/Medical mask/Medical 
mask/Medical Mask/annotations" 
image directory = "../input/face-mask-detection-dataset/Medical 
mask/Medical mask/Medical Mask/images" 
df = pd.read csv("../input/face-mask-detection-dataset/train.csv") 
df_test = pd.read_ csv("../input/face-mask-detection- 
dataset/submission.csv") 
acluoJi, Jig2 ini 
osdelus cils eL4SL Legal aa fil 


cvNet = cv2.dnn.readNetFromCaffe('weights.caffemodel") 
def getJSON(filePathandName): 

with open(filePathandName, 'r') as F: 

return json.load(f) 

def adjust_gamma(image, gamma=1.0): 

invGamma = 1.0 / gamma 

table = np.array([((i / 255.0) ** invGamma) * 255 for i in 
np.arange(@, 256)]) 


= 
O1 
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return cv2.LUT(image.astype(np.uint8), table.astype(np.uint8)) 

çà dll Gotall SEL Le y ill json ile getJSON dl 5,5 .1 
en ff le de ses 

mi es ébs je, pes kb ,é is Adjust gamma dl ds .2 
ee bles AU , all sl pal LB & tristimulus 31 luminance 
Jui «gamma <1 ESS 5) 5 +52 de Je asie 
Ita 5 cgamma> 1 0 Les CBI se Geël © LI I 3, sl 
Bye à epall ce dj 


ÜUUI &a)lco 
1 EN Lil JSON SUL GLS a GNT LIEU 5 ll 


Jsonfiles=[] 
for i in os.listdir(directory): 


jsonfiles.append(getJSON(os.path.join(directory,i))) 
jsonfiles[e] 


{'FileName': '2349.png', 

"NumOfAnno': 4, 

‘Annotations': [{'isProtected': False, 
‘D: 193452793312540288, 
‘BoundingBox': [29, 69, 285, 343], 
‘classname": ‘face_other_covering', 
"Confidence': 1, 

'Attributes': {}}, 

{'isProtected': False, 

D": 545570408121800384, 
‘BoundingBox': [303, 99, 497, 341], 
‘classname": ‘face_other_covering', 
"Confidence': 1, 

'Attributes': {}}, 

{'isProtected': False, 

D": 339053397051370048, 

‘BoundingBox': [8, 71, 287, 373], 
‘classname": ‘hijab_niqab', 
"Confidence': 1, 

'Attributes': {}}, 

{'isProtected': False, 

"ID": 100482004994698944, 
‘BoundingBox': [296, 99, 525,371], 
‘classname": ‘hijab_niqab', 
"Confidence': 1, 

'Attributes'": {}}]} 


QD lc il &o angl elié ut 


sl er SL Cle Annotations field is sl ce Le ss + 
Lure 5) e 85 57 ol 
5 face with mask a LaaJl el Lu QU us els Lies 


face no mask. 


df = pd.read_csv("train.csv") 


df.head() 
name x1 x2 y1 y2 classname 
0 2756.png 69 126 294 392 face_with_mask 
1 2756.png 505 10 723 283 face_with_mask 
2 2756.png 75 252 264 390 mask_colorful 
3 2756.png 521 136 711 2e mask_colorful 
4 6098.jpg 360 85 728 653 face no _ mask 


SU) Lonell Goal SEL Flu y non mask eLell, Lil lieu 
il Lux) ee pe LL EE des s done 532 0579 cl = -json 


data = [] 
img_size = 124 
mask = ['face with _mask'] 
non_mask = ["face no mask] 
labels={"mask':0,'without mask':1} 
for i in df["name"].unique( ): 
f = i+".json" 
for j in getJSON(os.path.join(directory,f)).get("Annotations"): 
if j["classname"] in mask: 


X,Y,W,h = j["BoundingBox"] 


img = cv2.imread(os.path.join(image directory,i),1) 


img = img[y:h,x:w] 
img = cv2.resize(img,(img_ size,img_size)) 
data.append([img,labels["mask"]]) 

if j["classname"] in non mask: 
X,Y,W,h = j["BoundingBox"] 


img = cv2.imread(os.path.join(image directory,i),1) 
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img = img[y:h,x:w] 

img = cv2.resize(img,(img_size,img_size)) 

data.append([img,labels["without mask"]]) 
random.shuffle(data) 


ps] 
for face in data: 
if(face[1] == @): 
p.append("Mask") 
else: 
p.-append("No Mask") 
sns.countplot(p) 


4000 
3500 
3000 
= 2500 
[+ 
2 
8 2000 


Mask No Mask 


re EL Àc 97 og (0 5 D9k y pail > <a 5 Le ol oeil cn Es 
né og GLS de Dis Be Lt SSD pe paies LON (5 LS 25e 
LES Ge 0 5) dll AUDE NI ie 0 deal 


ant EI Cat Je blt LRA5 de Gses 5 S 


€ [1] 

ve [] 

for features,label in data: 
X.append(features) 
Y.append(label) 


np.array(X)/255.0 
X.reshape(-1,124,124,3) 


dre aol eus as 


Y = np.array(Y) 

aagl elié élu mac Lu Jai à ui 
pl plièsol al pl CLS Loge le Luec 2 nt La LIU 5,301 DVI 

er 

model = Sequential() 
model.add(Conv2D(32, (3, 3), padding = "same", activation='relu, 
input_shape=(124,124,3))) 
model.add(Conv2D(64, (3, 3), activation='relu')) 
model.add(Conv2D(128, (3, 3), activation='relu')) 
model.add(MaxPooling2D(pool_ size=(2,2))) 
model.add(Dropout(@.25)) 


model.add(Flatten()) 
model.add(Dropout(@.5)) 
model.add(Dense(50, activation='relu')) 
model.add(Dropout(@.5)) 


model.add(Dense(1, activation="sigmoid')) 


model.compile(loss="binary crossentropy', optimizer="adam" 
,metrics=['"accuracy']) 
xtrain,xval,ytrain,yval=train test _split(X, 
Y,train_size=-0.8, random _ state=0) 

featurewise center=False, 

samplewise center-False, 

featurewise_std_ normalization-False, 

samplewise std normalization-False, 

zca_ whitening=-False, 

rotation range=15, 

width_shift_range-0.1, 

height _shift_range-0.1, 

horizontal flip=True, 

vertical flip-False) 


datagen.fit(xtrain) 


history = model.fit generator(datagen.flow(xtrain, ytraïin, 
batch_size=32), 
steps_per_epoch=xtrain.shape[@]//32, 
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epochs=50, 
verbose=1, 


validation data=(xval, yval)) 


&23goil juil 
ge D ne DEN I 5 a EN 5 55 pe 1698 (le JUN GLS Le sage (5 5 
OLLYI de sages face tags as de 5525 paix 5 ele LUI Le jague ce y pol 


test_images = ['1114.png','1504.jpg", 
"0072.)jpg",'0012.)jpg','0353.jpg','1374.jpg"'] 
gamma = 2.0 

fig = plt.figure(figsize = (14,14)) 


rows = 3 
cols = 2 
axes = [] 


assign = {'0':'"Mask','1':"No Mask"} 
for j,im in enumerate(test_images): 
image = cv2.imread(os.path.join(image directory,im),1) 
image = adjust gamma(image, gamma=gamma) 
(h, w) = image.shapel:2] 
blob = cv2.dnn.blobFromImage(cv2.resize(image, (300,300)), 1.0, 
(300, 308), (104.0, 177.0, 123.@)) 
cvNet.setInput(blob) 
detections = cvNet.forward() 
for i in range(@, detections.shape[2]): 
CPY: 
box = detections[@, @, i, 3:7] * np.array([w, h, w, h]) 
(startX, startY, endX, endY) = box.astype("int") 
frame = image[startY:endY, startX:endX] 
confidence = detections[@, @, i, 2] 
if confidence > 0.2: 
im = cv2.resize(frame,(img_size,img_size)) 
im = np.array(im)/255.0 
im = im.reshape(1,124,124,3) 
result = model.predict(im) 
if result>0.5: 
label _ Y = 1 


else: 


dre aol eus as 


label _Y = 0 


cv2.rectangle(image, (startX, startY), (endX, endY), (0, 
@, 255), 2) 

cv2.putText(image,assign[str(label Y)] , (startX, 
startY-10), cv2.FONT HERSHEY SIMPLEX, 1.5, (36,255,12), 2) 


except:pass 
axes.append(fig.add_ subplot(rows, cols, j+1)) 
plt.imshow(cv2.cvtColor(image, cv2.COLOR BGR2RGB)) 
plt.show() 


el ee dr JS Jess St la of LA of LS cedof AU Hess Je 5e 
57 les 05 AN pell DL GLS LES .spatial dominance &ls à Le Le ol 
Lez M 5, pell JB lus JRës 5 5,4 

ges 3À 0) pal Ball Laleul) Lise DS pleut Le cit ail Le mao 
at blob pla 25 6 je Ses sf ès LAN de ei] 
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Earthquake Prediction QUI ricil &o JjU JU Gui 83905 (3 
Model with Machine Learning 

NI ll plie JV 055 Loges p5se el LES Le Eulble ülädl sie à 
Dee lle pie el A DAS AO ST at JS set des 0 sigle due y As 
LS SUN AIN LS lame pe ddl ol; Less plécul 545 pe 
Jp Re SUN le dans AU ce NI LAS NI bail plu 


Url &o JjU JU EU e9.0) 
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import numpy as np 

import pandas as pd 

import matplotlib.pyplot as plt 
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data = pd.read_ csv("database.csv") 


data.columns 


Index(['Date', "Time, 'Latitude', Longitude’, 'Type', 'Depth', 'Depth Error, 
"Depth Seismic Stations’, Magnitude', 'Magnitude Type", 
‘Magnitude Error’, 'Magnitude Seismic Stations’, 'Azimuthal Gap’, 
"Horizontal Distance’, "Horizontal Error", "Root Mean Square', ID", 
‘Source’, Location Source’, Magnitude Source", 'Status'], 
dtype='object") 
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«æl; depth Gall; longitude J ll Ls-, latitude ,,2,«1 Lé-; time 25 JJl;date 
‘magnitude 


data = data [['Date':'Time' «'Latitude' ‘'Longitude' ‘«'Depth'« 
"Magnitude']] 
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data.head() 


date Time Latitude Longitude Depth Magnitude 
0 01/02/1965 13:44:18 19.246 145.616 131.6 6.0 
1 01/04/1965 11:29:49 1.863 127.352 80.0 5.8 
2 01/05/1965 18:05:58 -20.579 -173.972 20.0 6.2 
3 01/08/1965 18:49:43 -59.076 -23.557 15.0 5.8 
4 01/09/1965 13:32:50 11.938 126.427 15.0 5.8 
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import datetime 


import time 


timestamp = [] 
for d, t in zip(data['Date'], data['Time']): 
try: 
ts = datetime.datetime.strptime(d+' ‘+t, ‘"%m/%d/%Y %H:%M:%S') 
timestamp.append(time.mktime(ts.timetuple())) 
except ValueError: 
# print('ValueError') 
timestamp.append("ValueError') 
timeStamp = pd.Series(timestamp) 
data['"Timestamp'] = timeStamp.values 
final_data 


data.drop(['Date', ‘"Time'], axis=1) 
final_data = final _ data[final_ data.Timestamp != 'ValueError'] 
final _data.head() 


Latitude Longitude Depth Magnitude Timestamp 
0 19.246 145.616 131.6 6.0 -1.57631e+08 
1 1.863 127.352 80.0 5.8 -1.57466e+08 
2 -20.579 -173.972 20.0 6.2 -1.57356e+08 
3 -59.076 -23.557 15.0 5.8 -1.57094e+08 


4 11.938 126.427 15.0 5.8 -1.57026e+08 
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from mpl_toolkits.basemap import Basemap 


m = Basemap(projection='mill',llcrnrlat--80 ,urcrnrlat=80, llcrnrlon=- 


180 ,urcrnrlon=180, lat _ts=20,resolution="'c') 


longitudes = data["Longitude"].tolist() 

latitudes = data["Latitude"].tolist() 

#m = Basemap(width=12000000 ,height=-9000000 , projection="'lcc, 
#resolution=None,lat _1=80.,l1at 2=-55,lat 0=-80,1lon @=-107.) 

m(longitudes, latitudes) 
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plt.figure(figsize=(12,10)) 

plt.title("All affected areas") 

m.plot(x, y, "o", markersize = 2, color = ‘blue') 
m.drawcoastlines() 
m.fillcontinents(color='coral',lake color='aqua") 
m.drawmapboundary() 

m.drawcountries() 


plt.show() 
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from sklearn.cross validation import train test split 


final _ data[['Timestamp', ‘'Latitude', 'Longitude']] 
final _ data[['Magnitude', ‘Depth']] 


X train, X_ test, y_ train, y_ test = train test_split(X, y, 
test_size=0.2, random state=42) 


print(X train.shape, X test.shape, y train.shape, X_test.shape) 
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from keras.models import Sequential 


from keras.layers import Dense 


def create model(neurons, activation, optimizer, loss): 
model = Sequential() 
model.add(Dense(neurons, activation=activation, input _shape=(3,))) 
model.add(Dense(neurons, activation=activation)) 
model.add(Dense(2, activation='softmax')) 


model.compile(optimizer=optimizer, loss=loss, metrics=['accuracy']) 
ail de pal ST jf clés hyperparameters lui ll aus OV es 
:best fit Les We 
from keras.wrappers.scikit learn import KerasClassifier 


model = KerasClassifier(build fn=create model, verbose=®@) 


# neurons = [16, 64, 128, 256] 


neurons = [16] 
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# batch size = [10, 20, 50, 100] 

batch_size = [10] 

epochs = [10] 

# activation = ['relu', ‘tanh', ‘sigmoid', ‘hard sigmoid', ‘linear’, 
‘exponential'] 

activation = ['sigmoid', ‘'relu'] 

# optimizer = ['SGD', ‘RMSprop', ‘Adagrad', ‘Adadelta', ‘Adam’, 
‘Adamax', ‘Nadam'] 

optimizer = ['SGD', ‘Adadelta'] 


loss = ['squared_ hinge'] 


param_ grid = dict(neurons=neurons, batch size=batch size, 


epochs=epochs, activation=activation, optimizer=optimizer, loss=loss) 
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grid = GridSearchCV(estimator=model, param _grid=param grid, n_jobs=- 
1) 
grid result = grid.fit(X train, y train) 


print("Best: %f using %s" % (grid result.best score, 

grid result.best_params_)) 

means = grid result.cv results ['mean test_score'] 

stds = grid result.cv results ['std test _score'] 

params = grid result.cv results ['params'] 

for mean, stdev, param in zip(means, stds, params): 
print("%f (%f) with: %r" % (mean, stdev, param)) 


Best: 0.957655 using {'activation': ‘relu', ‘batch size': 10, 
‘epochs': 10, ‘loss': ‘squared hinge', ‘neurons': 16, ‘optimizer': 
"SGD'} 0.333316 (0.471398) with: {'activation: "sigmoid', 
"batch_size': 10, ‘epochs': 18, ‘loss': ‘squared hinge', ‘neurons': 
16, ‘optimizer': ‘SGD'} 0.000008 (0.000000) with: {'activation': 
"sigmoid', ‘batch size‘: 10, ‘epochs': 10, ‘loss': ‘squared hinge', 
"neurons': 16, ‘optimizer': ‘Adadelta'} @.957655 (0.029957) with: 
{'activation': ‘relu', ‘batch size': 10, ‘epochs': 19, ‘loss’: 
"squared_hinge', "neurons': 16, ‘'optimizer': "SGD'}  @.645111 


(6.456969) with: {'activation': ‘relu', ‘batch size': 10, ‘epochs': 
19, ‘loss': ‘squared_hinge', ‘neurons': 16, ‘optimizer': ‘Adadelta'} 
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model = Sequential() 
model.add(Dense(16, activation='relu', input_shape=(3,))) 
model.add(Dense(16, activation='relu')) 


model.add(Dense(2, activation='softmax')) 


model.compile(optimizer="SGD', loss='squared hinge', 
metrics=['accuracy']) 

model.fit(X train, y train, batch size=10, epochs=20@, verbose=1, 
validation data=(X test, y test)) 


[test_loss, test_acc] = model.evaluate(X test, y test) 
print("Evaluation result on Test Data : Loss = {}, accuracy = 
{}".format(test_loss, test_acc)) 


Evaluation result on Test Data : Loss = 0.5038455790406056, accuracy 
= 0.9241777017858995 
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import numpy as np 

import pandas as pd 

import keras 

import cv2 

from matplotlib import pyplot as plt 

import os 

import random 

from PIL import Image 
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samples = 20000 

df = pd.read_ csv("train.csv") 

df = df.loc[:samples,:] 

num_ classes = len(df["landmark_id"].unique()) 
num_ data = len(df) 


oi SEL Gin El D des Lol SUR ae le 5 AG Lies ON 


print("Size of training data:", df.shape) 


print("Number of unique classes:", num classes) 


Size of training data: (20001, 2) 
Number of unique classes: 1020 
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data = pd.DataFrame(df['landmark_id'].value counts()) 

#index the data frame 

data.reset_index(inplace=True) 


data.columns=['"landmark_id','count'] 


print(data.head(108)) 
print(data.tail(10)) 


landmark_id count 


e 1924 944 
1 27 584 
2 454 254 
3 1346 244 
4 1127 201 
5 878 193 
6 2185 177 
7 1101 162 
8 389 148 
9 219 139 
landmark_id count 
1918 499 2 
1611 1942 2 
1012 875 2 
1613 2297 2 
1014 611 2 
1015 1449 2 
1916 1838 2 
1817 684 2 
1018 374 2 
2 


1919 991 
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print(datal['count'].describe())#statistical data for the distribution 
plt.hist(data['count'],100,range = (0,944),1label = ‘'test')#Histogram 
of the distribution 

plt.xlabel("Amount of images") 

plt.ylabel("Occurences") 


count 1020 .000000 


mean 19.608824 
std 41.653684 
min 2.000000 
25% 5.000000 
50% 9.080000 
75% 21.000008 
max 944.000800 


Name: count, dtype: float64 
Text(®, 0.5, ‘Occurences') 


Occurences 


0 200 400 600 800 
Amount of images 


Da) ce All 15e Le A pe cobalt LIU OÙ cel SLI ou (cs 5 LS” 
print("Amount of classes with five and less datapoints:", 
(datal['count'].between(@,5)).sum()) 


print("Amount of classes with with between five and 10 datapoints:", 
(datal['"count'].between(5,18)).sum()) 


n = plt.hist(df["landmark_id"],bins-df["landmark_id"].unique()) 
freq_info = n[e@] 
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plt.xlim(@,datal['landmark_id'].max()) 
plt.ylim(@,datal 'count'].max()) 
plt.xlabel('Landmark ID') 
plt.ylabel('Number of images') 


Amount of classes with five and less datapoints: 322 
Amount of classes with with between five and 18 datapoints: 342 
Text(@, 9.5, ‘Number of images') 
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from sklearn.preprocessing import LabelEncoder 


lencoder = LabelEncoder() 
lencoder.fit(df["landmark_id"]) 


def encode_label(1bl): 


return lencoder.transform(lbl) 


def decode_ label(1bl): 


return lencoder.inverse transform(lbl) 


def get_image_ from _ number(num): 
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fname, label = df.loc[num,:] 


" jpe" 


fname = fname + 
f1 = fname[0] 
f2 = fname[1] 
f3 = fname[2] 
path = os.path.join(#f1,f2,f3,fname) 

im = cv2.imread(os.path.join(base path,path)) 


return im, label 


print("4 sample images from random classes:") 
fig=plt.figure(figsize=(16, 16)) 
for i in range(1,5): 
a = random.choices(os.listdir(base path), k=3) 
folder = base path+'/'+a[0]+'/'+a[1]+'/'+a[2] 
random_img = random.choice(os.listdir(folder)) 
img = np.array(Image.open(folder+'/'+random_img)) 
fig.add subplot(1, 4, i) 
plt.imshow(img) 
plt.axis('off') 


plt.show() 


from keras.applications import VGG19 
from keras.layers import * 


from keras import Sequential 


### Parameters 


# learning rate = 0.0001 

# decay_speed = 1e-6 

# momentum = 0.09 

# loss function = “"sparse categorical crossentropy" 


source _ model = VGG19(weights=None) 
#new_ layer = Dense(num classes, activation=activations.softmax, 


name="prediction") 
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drop_layer = Dropout(@.5) 
drop_layer2 = Dropout(@.5) 


model = Sequential() 

for layer in source model.layers[:-1]: # go through until last layer 
if layer == source model.layers[-25]: 

model.add(BatchNormalization()) 

model.add(layer) 

# if layer == source model.layers[-3]: 

# model.add(drop layer) 

# model.add(drop layer2) 

model.add(Dense(num_ classes, activation="softmax")) 


model .summary() 


opt1i = keras.optimizers.RMSprop(learning_ rate = 0.008001, momentum = 
0.09) 
opt2 = keras.optimizers.Adam(learning_rate=-0.001, beta_1-0.9, 
beta_2=-0.999, epsilon=1e-07) 
model.compile(optimizer=opt1, 

loss="sparse categorical_ crossentropy", 


metrics=["accuracy"]) 


#sgd = SGD(lr=learning rate, decay=decay_ speed, momentum=momentum, 
nesterov=True) 

# rms = keras.optimizers.RMSprop(lr=learning rate, momentum=momentum) 
# model.compile(optimizer=rms, 

# loss=loss function, 

# metrics=["accuracy"]) 

# print("Model compiled! \n") 
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Model: “sequential" 


Layer (type) Output Shape Param # 
batch_normalization (BatchNo (None, 224, 224, 3) 12 
block1_convi (Conv2D) (None, 224, 224, 64) 1792 
block1_conv2 (Conv2D) (None, 224, 224, 64) 36928 
block1_pool (MaxPooling2D) (None, 112, 112, 64) e 
block2_convi (Conv2D) (None, 112, 112, 128) 73856 
block2_conv2 (Conv2D) (None, 112, 112, 128) 147584 
block2_pool (MaxPooling2D) (None, 56, 56, 128) @ 
block3_convi (Conv2D) (None, 56, 56, 256) 295168 
block3_conv2 (Conv2D) (None, 56, 56, 256) 590080 
block3_conv3 (Conv2D) (None, 56, 56, 256) 5900808 
block3_conv4 (Conv2D) (None, 56, 56, 256) 590080 
block3_pool (MaxPooling2D) (None, 28, 28, 256) e 
block4_convi (Conv2D) (None, 28, 28, 512) 1180160 
block4_conv2 (Conv2D) (None, 28, 28, 512) 2359808 
block4_conv3 (Conv2D) (None, 28, 28, 512) 2359808 
block4_conv4 (Conv2D) (None, 28, 28, 512) 2359808 
block4_pool (MaxPooling2D) (None, 14, 14, 512) e 
block5_convi (Conv2D) (None, 14, 14, 512) 2359808 
block5_conv2 (Conv2D) (None, 14, 14, 512) 2359808 
block5_conv3 (Conv2D) (None, 14, 14, 512) 2359808 
block5_conv4 (Conv2D) (None, 14, 14, 512) 2359808 
block5_pool (MaxPooling2D) (None, 7, 7, 512) e 
flatten (Flatten) (None, 25088) e 

fc1 (Dense) (None, 4096) 1902764544 
fc2 (Dense) (None, 4996) 16781312 
dense (Dense) (None, 1920) 4178940 


Total params: 143,749,192 
Trainable params: 143,749,186 


Non-trainable params: 6 
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##H# Function used for processing the data, fitted into a data generator. 


def get_image from number(num, df): 
fname, label = df.iloc[num,:] 
fname = fname + ".jpg" 
f1 = fname[0] 
f2 = fname[1] 
#3 = fname[2] 
path = os.path.join(#f1,f2,f3,fname) 


im = cv2.imread(os.path.join(base path,path)) 


return im, label 


def image reshape(im, target_size): 


return cv2.resize(im, target_size) 


def get_batch(dataframe,start, batch size): 
[] 
[] 


image array 


label array 


end_img = start+batch size 
if end_img > len(dataframe): 


end_img = len(dataframe) 


for idx in range(start, end_img): 


n = idx 


im, label = get image from number(n, dataframe) 


im = image reshape(im, (224, 224)) / 255.0 
image_array.append(im) 


label_array.append(label) 


label_ array = encode label(label array) 


return np.array(image array), np.array(label_ array) 


batch_ size = 16 
epoch_shuffle = True 
weight_classes = True 


epochs = 15 


# Split train data up into 80% and 20% validation 


train, validate = np.split(df.sample(frac=1), [int(.8*len(df))]) 
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print("Training on:", len(train), "samples") 


print("Validation on:", len(validate), "samples") 


for e in range(epochs): 
print("Epoch:. 
if epoch shuffle: 


train = train.sample(frac = 1) 


, Str(e+1) + "/" + str(epochs)) 


for it in range(int(np.ceil(len(train)/batch size))): 


X train, y_train = get _batch(train, it*batch size, batch size) 


model.train on batch(X train, y_train) 


model.save("Model.h5") 


Training on: 16009 samples 


Validation on: 4901 samples 


Epoch: 1/15 
Epoch: 2/15 
Epoch: 3/15 
Epoch: 4/15 
Epoch: 5/15 
Epoch: 6/15 
Epoch: 7/15 
Epoch: 8/15 
Epoch: 9/15 


Epoch: 10/15 
Epoch: 11/15 
Epoch: 12/15 
Epoch: 13/15 
Epoch: 14/15 
Epoch: 15/15 


LS LS (55 Les pbgal jet a QUI 5h las pô gaill cute Les DS 
LM least LES hs La 
### Test on training set 


batch_ size = 16 


errors = @ 


good _preds 


[] 
bad_preds = [] 
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for it in range(int(np.ceil(len(validate)/batch size))): 


X train, y_ train = get _ batch(validate, it*batch size, batch size) 


result = model.predict(X_ train) 
cla = np.argmax(result, axis=1) 
for idx, res in enumerate(result): 
print("Class:", cla[idx], "- Confidence:", 
np.round(res[claf[idx]],2), "- GT:", y_train[idx]) 
if cla[idx] != y_train[idx]: 


errors = errors + 1 


bad_preds.append([batch_ size*it + idx, cla[idx], 


res[cla[idx]]]) 


else: 


good_preds.append([batch size*it + idx, cla[idx], 


res[cla[idx]]]) 


print("Errors: 
errors)/len(validate),2)) 


#Good predictions 


good _ preds = np.array(good_ preds) 


good _preds = np.array(sorted(good_ preds, key = lambda x: x[2], 


reverse=True)) 


fig=plt.figure(figsize=(16, 16)) 

for i in range(1,6): 
n = int(good preds[i,0]) 
img, 1bl = get_image from number(n, validate) 
img = cv2.cvtColor(img, cv2.COLOR BGR2RGB) 
fig.add_ subplot(1, 5, i) 
plt.imshow(img) 


1b12 = np.array(int(good preds[i,1])).reshape(1,1) 


sample_cnt = list(df.landmark_id).count(1bl) 


plt.title("Label: " + str(lbl) + “\nClassified as: " + 
str(decode label(1b12)) + "\nSamples in class " + str(lbl) + ": 


str(sample cnt)) 


, errors, "Acc:", np.round(188*(len(validate)- 


[eS) 
— 
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plt.axis('off') 


plt.show() 
Label: 355 
Classified as: [355] 
Samples in class 355: 10 
Label: 27 Label: 1924 
Label: 1924 Classified as: [27] Classified as: [1924] 
Classified as: [1924] Samples in class 27: 504 Label: 1924 Samples in class 1924: 944 

Samples in class 1924: 944 


Classified as: [1924] 
Samples in class 1924: 944 
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Chatbot with Machine gg JU lc &o JD Saniol (5 

Learning and Python 
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CChatbot Jos ais 
dl tete ons SN dell lasse Gb pe Chatbot Eole y pole EN 
Li ON 8 LS LS Le pane 55 f pars po M ES Less Co SUN ès 
ns Lo) Le LOL LL Le gage LAS] 135 US Hole poli y 3 vas 
NUL ei I tes «04 phase Hole y eLESY oa LUI Lo sage LE Y 
eg A ji does Gus) po JEUN plésell 4 a ‘intention ul" Les p si 
QE phésus (je lol © y 3) laläts Du, JS 
Jloll Je let bell chatbot gels ou List Reis of gai ce «il 
gags rs LU Loge Al2ë os SLI ox des II os ILE cas axe (SI 
Woo GE N pleall je Jul elfs Doyen gels CAN Wu le Ge 
Dyndécenall 4 3% Le ng5 I Ge NI Lol quels plons éçmedémell pe bleu 
ré di ces db out NI Lis ul mb y of Gel 3e Le 4 plat Gi en 
re dénnall Ju) #15 Lol LI 51 
L De nel osotemell ais ae LIN eè lb ebui chatbot Jaz de LES 
be Lil de Clé es dla Las el es fit SL dsebs dote 
Les pal el QU plat b BI chatbot 55e cuis List eg Cu 
(NV) Lise 6 Cl GUSTS CO Les se Co) SULS eds 
QD! rdc lo yo LU polaiul Chatbot Lil 
pal gr pol ex CS Ji etes AT és ob plie Bols y LES Y 
55e 0 9b due A plage JT dl pe be y LE N Les I les Al 
seb 
tensorflow==2.3.1 
nltk==3.5 
colorama==0.4.3 


numpy==1.18.5 
scikit learn==0.23.2 


Flask==1.1.2 
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Chatbot Llgi 23225 
Cas Lg lb Gas 21 Le pe 2e gas Ua LIN ae dde I OM press 
L ‘intents.json" ul JSON cle LE ele Us 3 JS Le él 550 1 x ce 
JU sel Le CL ja SU ‘a 

{"intents": [ 

{"tag": "greeting", 

"patterns": ["Hi", "Hey", "Is anyone there?", "Hello", “Hay"], 
"responses": ["Hello", "Hi", "Hi there] 
} 
{"tag": "goodbye", 
"patterns": ["Bye", "See you later", "Goodbye"], 
"responses": ["See you later", “Have a nice day", "Bye! Come back 
again] 

} 

{"tag": "thanks", 

"patterns": ["Thanks", "Thank you", "That's helpful", "Thanks for 
the help"], 

"responses": ["Happy to helpl!l", "Any timel!", "My pleasure", "You're 
most welcome!"] 

}: 

{"tag": "about", 

"patterns": ["Who are you?", “What are you?", "Who you are?" ], 
"responses": ["I.m Joana, your bot assistant", “I'm Joana, an 
Artificial Intelligent bot"] 

} 

{"tag": "name", 

"patterns": ["what is your name", “what should I call you", “whats 
your name?"], 

"responses": ["You can call me Joana.", "I'm Joanal!l", "Just call me 
as Joana"] 

}; 

{"tag": "help", 

"patterns": ["Could you help me?", "give me a hand please", "Can you 
help?", "What can you do for me?", "I need a support", "I need a help", 
"support me please"], 

"responses": ["Tell me how can assist you", "Tell me your problem to 


assist you", "Yes Sure, How can I support you"] 


Ug bo QU rie il go JD ao 


}; 

{"tag": "createaccount", 

"patterns": ["I need to create a new account", “how to open a new 
account", "I want to create an account", "can you create an account for 
me", “how to open a new account"], 

"responses": ["You can just easily create a new account from our web 
site", "Just go to our web site and follow the guidelines to create a 
new account"] 

}; 

{"tag": "complaint", 

"patterns": ["have à complaint", "I want to raise a complaint", 
"there is a complaint about a service"], 

"responses": ["Please provide us your complaint in order to assist 


you", "Please mention your complaint, we will reach you and sorry for 


any inconvenience caused"] 


} 


GUÜUUI pui 
sel GNT elle D Ab plièul Hole 9 LASY Lagall oda çye LOU 5 ss has 
USA sul Gb ve ädal de Lt LH LUI Éslell y cop LUI 

45 al pl 
import json 
import numpy as np 
import tensorflow as tf 
from tensorflow import keras 
from tensorflow.keras.models import Sequential 
from tensorflow.keras.layers import Dense, Embedding, 
GlobalAveragePooling1D 
from tensorflow.keras.preprocessing.text import Tokenizer 
from tensorflow.keras.preprocessing.sequence import pad sequences 
from sklearn.preprocessing import LabelEncoder 

gleall Sol lei; JSSON cake À 5 
with open('intents.json') as file: 
data = json.load(file) 


training_sentences = [] 


training_ labels = [] 
labels = [] 


responses = [] 


for intent in data['intents']: 
for pattern in intent['patterns']: 
training_sentences.append(pattern) 
training_labels.append(intent['tag']) 


responses.append(intent|['responses']) 


if intent['tag'] not in labels: 
labels.append(intent['tag']) 


num_ classes = len(labels) 


Scikit- 4e La 55 al label encoder &eual 5 4L elièul I ON ts 


10 #L gLearn 
1bl_ encoder = LabelEncoder() 
1bl_ encoder.fit(training labels) 
training_ labels = 1bl encoder.transform(training labels) 
Tokenization j10 JU] 


pis Bols © $ LS Tokenization &,L put OUI Les I EN ct 
NM els ob 

vocab size = 1000 

embedding_ dim = 16 

max_len = 20 


oov_token = "<OOV>" 


tokenizer = Tokenizer(num words=vocab size, oov_token=oov_token) 
tokenizer.fit_ on texts(training sentences) 

word_index = tokenizer.word_index 

sequences = tokenizer.texts to _ sequences(training_ sentences) 
padded_sequences = pad sequences(sequences, truncating="'post", 


maxlen=max_len) 


Ug bo QU rl il &o JU 22ioII 


SON belle db plis Bols y els les Gien SNS LUI 5 ls OV 
Lu el Bol 5 9 cuil Les ARS + fs Dul ON Lee RS Lo ju 


model = Sequential() 

model.add(Embedding(vocab size, embedding dim, input _ length=max_len)) 
model .add(GlobalAveragePooling1D()) 

model.add(Dense(16, activation='relu')) 

model.add(Dense(16, activation='relu')) 


model .add(Dense(num_ classes, activation='softmax')) 


model.compile(loss="sparse categorical crossentropy', 


optimizer='adam', metrics=['accuracy']) 


model .summary() 
epochs = 500 
history = model.fit(padded sequences, np.array(training_ labels), 


epochs=epochs) 


Layer (type) Output Shape Param # 
enbedding (Embedding) (None, 20, 16) 160% 
global_average_pooling1d (GL (None, 16) 0 

dense (Dense) (None, 16) 272 
dense_1 (Dense) (None, 16) 272 
dense_2 (Dense) (None, 8) 136 


Total params: 16,680 
Trainable params: 16,680 
Non-trainable params: @ 


‘ui J à Au Jo 
Bols y eLa] lee GES (pe dl LI ad of LS Le «5 sel uns Les Ai 
SN ja plaie ge Res > pinll Béss Les NI ls db plie 

Lai Jéall Ginexll 
# to save the trained model 


model.save("chat_model") 
import pickle 


# to save the fitted tokenizer 
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with open('tokenizer.pickle', ‘'wb'}) as handle: 
pickle.dump(tokenizer, handle, protocol=pickle.HIGHEST PROTOCOL) 


# to save the fitted label encoder 

with open('label encoder.pickle', ‘wb'}) as ecn_ file: 
pickle.dump(1bl encoder, ecn file, 

protocol=pickle.HIGHEST PROTOCOL) 


20)! JU lil 839059 ygÙU polis ul Chatbot <Liy 
ge Del paul ps Lune de pèse pe Je Lis ui ab, és oi este 
pénal) Us cènes 60 448 JO Lee J'onaoll à5 Cl La less ae GENI pe 

ten 35 dei [eat En) Gi 
mport json 
import numpy as np 


from tensorflow import keras 


from sklearn.preprocessing import LabelEncoder 


import colorama 
colorama.init() 


from colorama import Fore, Style, Back 


import random 


import pickle 


with open("intents.json") as file: 
data = json.load(file) 


def chat(): 
# load trained model 


model = keras.models.load model('chat_model') 


# load tokenizer object 
with open('tokenizer.pickle', ‘rb') as handle: 


tokenizer = pickle.load(handle) 


ogg JU ici go JD éaniol 


# load label encoder object 
with open('label encoder.pickle', ‘rb') as enc: 


1bl_encoder = pickle.load(enc) 


# parameters 


max_len = 20 


while True: 
print(Fore.LIGHTBLUE EX + "User: " + Style.RESET ALL, end="") 
inp = input() 
if inp.lower() == "quit": 
break 
result = model.predict(keras.preprocessing.sequence.pad_ sequences 
(tokenizer.texts to _ sequences([inp]), 
for i in data['intents']: 
if i['tag'] == tag: 
print(Fore.GREEN + “"ChatBot:" + Style.RESET ALL , 


np.random.choice(i['responses'])) 


# print(Fore.GREEN + "ChatBot:" + 
Style.RESET ALL,random.choice(responses)) 


print(Fore.YELLOW + "Start messaging with the bot (type quit to stop)!" 
+ Style.RESET ALL) 
chat() 


5 0,5 of UT NI ils db plaseel chatboteLésl Le Ses oi A ii a oa 
NI ls db plis Chatbot lil LES J se Wall os ol 
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Title Generator with Machine (ll rlcül &o ulgioil &uio (6 
Learning 

Le » 3 YouTube Je All Al Dis QUL Le sages paul aaal oje à 
Loge) adsl is Us à AI hell peut ail eLE5 Ga piges cu db 
Hs 2 5i youtube 45 Ali dise eLiil 

natural language processing &aub 44 iles dogs Title generator dl sxll ; és 
FX; «text synthesis je Là, ES (le «NI dll ie Judal LS, 225 as 
.conversational systems &sleul äd5|; «speech to text 25 4 

Lt das Le posa con Qu ail pote sf D pall pote Legal p3se5 #1 
GLS Jules Gaël seb A SI plait al ge 

Qi pole il &o ylgichtritio 

HS 51 ul Gb 6 NI belle ob pliguts dise A ge eLESY Lagall aie lulu 
Le Lagall sde Glgotseul I LULU oole sages (LES Res BL cle sages 861,53 
:La 


import pandas as pd 
import string 
import numpy as np 


import json 


from keras.preprocessing.sequence import pad _ sequences 
from keras.layers import Embedding, LSTM, Dense, Dropout 
from keras.preprocessing.text import Tokenizer 

from keras.callbacks import EarlyStopping 

from keras.models import Sequential 


import keras.utils as ku 


import tensorflow as tf 
tf.random.set_seed(2) 

from numpy.random import seed 
seed(1) 


#load all the datasets 
df1 = pd.read_csv('USvideos.csv') 
df2 = pd.read_csv('CAvideos.csv') 


QD loi &o ulgicil{iio 


df3 = pd.read_ csv('GBvideos.csv') 


#load the datasets containing the category names 
datal = json.load(open('US category _id.json')) 
data2 = json.load(open('CA category _id.json')) 
data3 = json.load(open("GB_ category _ id.json')) L 
SN il cs es JA ES LLYI je plissul Q RS > BEL iles SI OI ct 
SI leziles s LL Cads olbe por Gb Lors D snall pére doge Le L ele 
lei Jr tes 
def category _ extractor(data): 
i d = [data['items'][i]['id'] for i in range(len(data['items']))] 
title = [data['items'][i]['snippet']["title"] for i in 
range(len(data['items'l))] 
i d = list(map(int, i d)) 
category = zip(i d, title) 
category = dict(category) 


return category 


#create a new category column by mapping the category names to their 
id 

df1i['category_title'] = 

df1['category_id'].map(category_ extractor(data1)) 
df2['category_title'] = 

df2['category_id'].map(category_ extractor(data2)) 
df3['category_title'] = 

df3['category_id'].map(category_ extractor(data3)) 


#join the dataframes 
df = pd.concat([df1, df2, df3], ignore index=True) 


#drop rows based on duplicate videos 
df = df.drop duplicates('video_ id') 


#collect only titles of entertainment videos 
#feel free to use any category of video that you want 
df[df['category_title'] == ‘Entertainment']['title'] 


entertainment = entertainment.tolist() 


entertainment 
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#remove punctuations and convert text to lowercase 
def clean text(text): 
text = ‘'.join(e for e in text if e not in 


string.punctuation).lower() 


text = text.encode('utf8').decode('ascii', ‘ignore') 


return text 
corpus = [clean text(e) for e in entertainment] 


Cul agi 
-tokens &5eell 556,1 5e de JR GEL Jlés| danddeil At âgelles pes Hs 
.n-gram 8 pee)l 33e dl ce Aube eLé] a DL LES de JM 3 sul 
25 a ls ns Ga 2 ta ne ea 
Si blé sig blé SH OUUS tal 0, of 5e .vocal corpus 
O3 Vall Ge de sommes el 6e àbudes  n-grams ob «lil ska delai ci 
:COrpus 4e gazelle 552 11 Kiel &les ça Tokenization ; sil 
tokenizer = Tokenizer() 
def get_sequence_of_tokens(corpus): 
#get tokens 


tokenizer.fit_on_texts(corpus) 


total words = len(tokenizer.word_ index) + 1 


#convert to sequence of tokens 

input_sequences = [] 

for line in corpus: 

token_ list = tokenizer.texts to sequences([line])[@] 
for i in range(1, len(token list)): 

n_gram sequence = token_ list[:i+1] 


input_sequences.append(n_ gram sequence) 


return input_ sequences, total words 
inp_ sequences, total words = get_ sequence of tokens(corpus) 
sl bell JIbi 53 bof ces dans JT ets 6,6 of Rs et os 
ele Us LS RU Gode Lis 9985 Le êole nant RU pleut de 
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is OUI Bilee oe Va os Le OUI Bulles Aile den Loti a 
ner 

Jebesll Jb Giles Eee e[Jedel Job x xl pus] bé pèeall place EU 
4 paa)l age Louls (0 33e) 3e 9 Duberl e Les p 385 ALI ee (Bus Usb ue 
is deb JsY filling + Ja Sneoll 53e 4 LI e Jos Soul os (eus 
labels 245: predictors 155 LE I et ee 5 

LAS n-gram çys LIU LS Ses n-gram je SUIe eLESL ps5le 


def generate padded_ sequences(input_sequences): 
max_sequence _len = max([len(x) for x in input _sequences]) 
input_sequences = np.array(pad_ sequences(input_sequences, 
maxlen=max_ sequence _len, padding=’pre?)) 
predictors, label = input _sequences[:,:-1], input _ sequences[:, -1] 
label = ku.to categorical(label, num classes = total words) 
return predictors, label, max sequence _len 
predictors, label, max sequence len = 


generate padded_ sequences(inp_ sequences) 


LSTM 83905 


Inputs: outputs: Nonlinearities: Vector operations: 
Input vector Memory from Sigmoid Ge) Element-wise 
current block multiplication 
Memory from Output of Hyperbolic F | Element-wise 
previous block current block tangent —_—__— Summation / 


Concatenation 


Output of 
© previous block Bias: Co) 
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ol 5 4 recurrent neural networks(RNN) 5, Soil ext RU 3 
Je ul le noie cle Seal dll ie of cales NS dis 
Del oeil dat d'enle suall ,225 due Last Lienlt ct Raul GRAN Se 
5,515 AL" Les Jar I Lou I LS dolile Li] LI Ms ç535s de25 
aa LU "memory state 
5 L 'remembers $ks" À 254 Je Je je De RNN Bises bi) kon 
Gi gradient pui Lyse Lai y os dlablse 8 SU DL EN 530 deb 
c£ des 58 ES 
Jde Ge call je pres li je LS ane put Mit LS Sad ie (3 
de g5 pol 5 Salles (Je ALU QUI Globe Luss Ms 4 
(Long Short Term Memory (sil ë 25 4 ,L 8,515) LSTM :RNN 
LSTM &90j &o ylgicil 1190 
Le 5 nes IR Lluf mous als Gui We) Lilo] 4e Le LSTM p5 505 (5 52 
Dig LSTM (so Gel Ce Les 4j ge Los cgutell db DL Gassss = 
EL pi le 5 sil 

JS Si Lis 56 :Inputlayer JV L 

LSTM ia plait PS us :LSTM LL 

-overfitting ll L 5 Lie LES LL :Dropoutlayer Ji il 

A Le Uesmell LIU LOI LJlesst us :Outputlayer LEN EL 
il & Title Generator dalle ce Logol 73505 LI LSTM g5 505 OV pit 
Ji 


def create model(max sequence len, total words): 
input_len = max sequence len — 1 


model = Sequential() 


# Add Input Embedding Layer 
model.add(Embedding(total words, 19, input _length=input_len)) 


# Add Hidden Layer 1 — LSTM Layer 
model.add(LSTM(188)) 
model.add(Dropout(@.1)) 


# Add Output Layer 
model.add(Dense(total words, activation=’softmax”)) 


model.compile(loss=’categorical crossentropy’, optimizer=’adam”) 


QD go ulgicil{iio 


return model 
model = create model(max sequence len, total words) 
model.fit(predictors, label, epochs=20, verbose=5) 
eigoil Juil : JD oc Ül &o ulgicil 90 
pétasse dlsal pres D Ut CNT dll 55e mel 355 ON 
al Ju-5Yl LS 5 = .JU-5Yl LS Le ls Olgall Ro Gt Le das LUI 
sal bell pes all pr sell LI Less pee où HS Jebesll Ja pi 
def generate text(seed text, next _ words, model, max_ sequence _len): 
for __ in range(next_words): 
token_ list = tokenizer.texts to_sequences([seed_ text])[0] 
token_ list = pad_ sequences([token_ list], maxlen=max_ sequence _len- 
1, padding=’pre’) 


predicted = model.predict_classes(token list, verbose=0@) 


output_word = 

for word,index in tokenizer.word_index.items(): 
if index == predicted: 

output_word = word 

break 

seed text += “ “+output_word 


return seed text.title() 


uAglioil 119j 
L obul lat pète pige 2 éclat LES Y Do GLS Li L SI 
( (“spiderman”, 5, model, max sequence _len)) 


Output: Spiderman The Voice 2018 Blind Audition 


phil pleut D all péète 23 sas LES) LES Je Wall oda col 45 0, of ei 
0 fab dus » ds NI 
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Deepfake Detection Job polaàüul ÿLocil ou jUI ous (7 
with Python 
st Decpfake Detection Gexll cas; xës I SAT Ge UlaaJl os 2 
a LS, Deepfake Gel cès;ul 3 Le ge Yes loge es Aù NI cells à ft 
Ale Je ue 5,2 | die cils «Deepfake 52 L ©, Y ES lil. ds 
DL ps Decpfake Gall ès yil CLEST Loges EL Aus of (LS ds ALI 
Ab ls 
U9U rlañul 810) du jUI été 
Cas c0n34he Grnles si ali) ue pt able ge Do je Axis il 
gril Ginnt) URhe Connol s 5 all cl oi Abel pdl ablis CLS 
Je olas Je Ds ,555 Deepfke 445 able of OM 2e es cas Ad 
ha be le Jess will sis où F5 EU (JL, youtubez twitters Facebook 
AS hall le FASY CVs ue 10) LS slésul p3è Facebook 55 «AShall 
HS£l sis Jde Lai Google; Twitter fe seules Jexis 
Gp LAS ge Ua! os (Age Legal ou Granll cel LES] GB JE 
pp GUEST QU uni blé Mes gode Lion ge dal lait le 
bless or dl obesls dell, Lil gel nblèe (ee 
Jocl «lil 8Lo cl ou jUI rt 
ils db list Deepfake Gel cs 5 sine ES LS LES (5 5 NI Les 
ns AM OS sf ul Gb 8 Legal og lulu NI 
import numpy as np 
import matplotlib.pyplot as plt 
import cv2 
import pandas as pd 
import glob2 
import os, fnmatch 
from pathlib import Path 
# import mtcnn 
from mtcnn.mtcnn import MTCNN 
BUS LS ue NL aa Latoi il ablès ge pes Lu of Le nes co 
Gel Col GUEST Loge Des PL I bons Le ge ie Las SV one pe dre lag 
-Deepfake 
pl allie NS Boule) Us LED pl Ge OV 


cristal ane 


def extract_ multiple videos(intput_filenames, image path_infile): 


Extract video files into sequence of images. 
i = 1 # Counter of first video 
# Iterate file names: 

cap = cv2.VideoCapture('your video file path.avi' or 
intput_filenames) 
if (cap.isOpened()== False): 

print("Error opening file") 

# Keep iterating break 

while True: 


ret, frame = cap.read() # Read frame from first video 


if ret: 
cv2.imwrite(os.path.join(image path_infile , str(i) + 
".jpg'), frame) # Write frame to JPEG file (1.jpg, 2.jpg, ...) 
# you can uncomment this line if you want to view them. 
# cv2.imshow('frame', frame) # Display frame for testing 
i += 1 # Advance file counter 
else: 
# Break the interal loop when res status is False. 
break 
cv2.waitKey(50) #Wait 5@msec 
cap.release() 
QI <lcaüul 


extract_multiple videos(fake video name, fake_image path_for_frame) 


extract multiple videos(real_ video name, real_image path_for_ frame) 
Cnil GUEST Loge) Lealass pll ablis del Le LR AI (Las dues ON! 
JS ble JAE 5e Goal cas es Se LES (55 OM Les .Deepfake Gel 


from skimage import measure 
def mse(imageA, imageB): 

# the ‘Mean Squared Error’ between the two images is the 

# sum of the squared difference between the two images; 

# NOTE: the two images must have the same dimension 

err = np.sum((imageA.astype("float") - imageB.astype("float")) ** 
2) 

err /= float(imageA.shape[@] * imageA.shape[1]) 


# return the MSE, the lower the error, the more "similar" 


O1 
CD 
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# the two images are 
return err 
def compare _images(imageA, imageB, title): 
# compute the mean squared error and structural similarity 
# index for the images 


m 


mse(imageA, imageB) 

S = measure.compare ssim(imageA, imageB) 

# setup the figure 

fig = plt.figure(title) 

plt.suptitle("MSE: %.2f, SSIM: %.2fF" % (m, s)) 

# show first image 

ax = fig.add subplot(1, 2, 1) 

plt.imshow(imageA, cmap = plt.cm.gray) 

plt.axis("off") 

# show the second image 

ax = fig.add subplot(1, 2, 2) 

plt.imshow(imageB, cmap = plt.cm.gray) 

plt.axis("off") 

# show the images 

plt.show() . 
le Ge Al poils ON dl ce de penll y all DU el 3 SU 

Copa) on DU (of 5285 ge Be «9 9 ue EN Il Gb pol pt 

Ole fall 


Original Modified 


coll 8 je JE 4e Goal cas lle je CAS Le LR a di 12 a eds 
Deepfake Gall cé; Je Uldll oja cut 5 DS of Chef gels LL 
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Classifÿy Nationalities (JD le Ùl plait Sluial ouai (8 
with Machine Learning 

D pu epréilou plasul, LNT e Cénes LES Je Salbf 05 su Joli ia 
AU LES Jen LS Ha 35 LMI plu ob ces LES GS 
soul 

Cluuia)| ouai 

role Los all pl lol Gb De obedl cames se JM bell ages 153 
ls pl ès sg Les «una Le sf una Life eleul Je £ colceul 
ogoll 


from tensorflow keras 
tensorflow tf 
pandas pd 


os 


re 


da lgstéul I SU le same JS Res LL le page 33 gui Les ON 
ge sel eple LUI le sene sul de DVI La 5e Dig Hall ads 
UNI dal 5 LI cède) esuelus 
male data = pd.read csv(male.csv) 
female data = pd.read csv(femaile.csv) 
repl_list = ['s/o','d/o','w/o','/",'&",'",",'-"] 
def clean data(name): 
name = str(name).lower() 
name = (''.join(i for i in name if ord(i)<128)).strip() 
for repl in repl list: 
name = name.replace(repl," “) 
if ‘'@' in name: 
pos = name.find('@') 
name = name[:pos].strip() 
name = name.split(" ") 
name = " ".join([each.strip() for each in name]) 


return name 


def remove records(merged_ data): 
merged_datal['delete'] = @ 


merged_data.loc[merged_data['name'].str.find('with') != -1, 


O1 
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"delete'] = 1 
merged_data.loc[merged_datal[ 'count_words']>=5, "delete" ]=1 
merged_data.loc[merged_datal[ ‘count_words']==0, "delete" ]=1 
merged_data.loc[merged_data[ 'name'].str.contains(r'\d') == True, 
delete’ ]=1 
cleaned_ data = merged data[merged_ data.delete==0] 
return cleaned_ data 
merged_ data = pd.concat((male data,female data) ,axis=0@) 
merged_datal[ 'name'] = merged_datal['name'].apply(clean_ data) 


merged_datal['count_words'] = merged_datal['name'].str.split().apply(len) 


cleaned_ data = remove records(merged_ data) 

indian_cleaned_ data = cleaned_ data[['name','count_words']].drop duplicates( 
subset="'name",keep="#first") 

indian_cleaned_data['label'] = ‘indian' 


len(indian_cleaned data) 


13754 


LS Cul de JB 4e le Les LIT GEL SYLS NI Us Las des 
.13000 
raléee ble 4e eLoul N 5 Mis .Faker ous Ai ose Sa cul Le eLoub au 


from faker 


‘Brian Evans? 
Jabl os Lies Lung SU Le sage DL SR 25 selon sue ui GUEST Aÿ 


ŒUSS de35 gl eloub clou 0 LES Le Lutgll BL Le sages en et ls 5 
Lai Gin Al 3 plu aol ar I don ce 


non_indian_ data. (@) 
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name count_words 
0 sara gulbrandsen 2 
1 kathryn villarreal 2 
2 jennifer mecormick 2 
3 james eaton 2 
4 melissa bond 2 


ads ON A (ge | 130003 xs Le c«! 14000 Sly Hlboll L el 
Lou plat ol Laney ins 


from keras.utils import to_categorical 

from keras.models import Sequential 

from keras.layers import Dense 

from keras.layers import LSTM 

from keras.preprocessing import sequence 

from keras.preprocessing.text import Tokenizer 

from keras.layers.embeddings import Embedding 

from keras.utils import to_ categorical 

import numpy as np 

from sklearn.preprocessing import LabelEncoder 

from keras.callbacks import Callback 

np.random.seed(42) 

def char _encoded representation(data,tokenizer,vocab_ size,max_len): 
char_index_sentences = tokenizer.texts to sequences(data) 
sequences = [to _ categorical(x, num_ classes=vocab size) for x in 

char_index_sentences] 
X = sequence.pad_ sequences(sequences, maxlen=max_len) 


return X 


def build model(hidden_units,max_len,vocab size): 
model = Sequential() 
model.add(LSTM(hidden_units,input_shape=(max_len,vocab size))) 
model.add(Dense(1, activation=’sigmoid”)) 
model.compile(loss=’binary crossentropy’, optimizer=’adam”, 
metrics=[‘accuracy”]) 
print(model.summary()) 


return model 


O1 
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model = build model1(180,max len,vocab size) 
model.fit(X train, y_ train, epochs=20, 


batch_size=64,callbacks=myCallback(X_test,y_ test)) 


names predictions_Istm_char 
0 lalitha indian 
1 tyson non_indian 
2 shailaja indian 
3 shyamala indian 
4 vishwanathan indian 
5 ramanujam indian 
6 conan non_indian 
7 kryslovsky non_indian 
8 ratnani indian 
9 diego non_indian 
10 kakoli indian 
11 shreyas indian 
12 brayden non_indian 
13 shanon non_indian 


ml SN el pleut Dogs oball cames Le ES Col A bi ee oda 
La pe JO 5 Le 5,5 ei Ses cle GLASS PSI 5 QT or 
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Predict Car Prices with QI oicil &o Gljuuull jou &égi (9 
Machine Learning 
Lt pleut, gl Gluels p5s6 Dot LES Le élubi Gi aol sde 
ES La potsute I SU Le same .PyTorch pleut CN Lol plièul, 
Cl anal (Giles Loi datsoll DL plrul pe Dodge SUL 6e 8,Le bull lus 
8 485 D GA Lécall, das 9, Ge 258 de cb de some (65255 «5 NI 
SL ps ya 
SPyTorch ga Lo 
SPython dk Le pexll xl iles el lys 55 ob gkSe 8 PyTorch 
Det À] es Age LU Ze 4 a DL Goal JEU PyTorch dlxé, ges il 
A les 5f Graal let les CES Gaall he &,e ossi PyTorch &Se 355 
Éb LL Gall il 75bes eLés] JS Sie PyTorch à,» O5 «4 exil 
PyTorch pla iüul cl ul jou 4895 
sel ile .PyTorch plié St eut Gt CNT Lil Loges 1 of 
gi SEL âcseme (DES Je hogell okgl Lez A 5 21 Sal 7 
La (pe Dig Logall oh iles 
import torch 
import jovian 
import torch.nn as nn 
import pandas as pd 
import matplotlib.pyplot as plt 


import torch.nn.functional as F 


from torch.utils.data import DataLoader, TensorDataset, random split 


oGLUI T5 Les ON 
DATA _FILENAME = "car data.csv" 


dataframe raw = pd.read_ csv (DATA FILENAME) 


dataframe raw.head() 
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Car_Name Year Selling Price Present_Price Kms_Driven Fuel Type Seller Type Transmission Owner 


0 ritz 2014 3.35 5.59 27000 Petrol Dealer Manual 
1 sx4 2013 4.75 9.54 43000 Diesel Dealer Manual 
2 ciaz 2017 7.25 9.85 6900 Petrol Dealer Manual 
3 wagonr 2011 2.85 415 5200 Petrol Dealer Manual 
4 swift 2014 4.60 6.87 42450 Diesel Dealer Manual 


® © © © oo 


Dh ent 555 ane Ji tes dleiésul Je 8 ob JRS L5, Ste 
eat lg pas cl ul ef LL 6325 Las pl Gael Ÿ CA steel 
ACER EE 


your name = “"Aman Kharwal" # at least 5 characters 
def customize dataset(dataframe raw, rand_str): 

dataframe = dataframe raw.copy(deep=True) 

# drop some rows 

dataframe = dataframe.sample(int(@.95*len(dataframe)), 
random state=int(ord(rand str[@]))) 


# scale input 


dataframe.Year = dataframe.Year * ord(rand_str[1])/100. 


# scale target 
dataframe.Selling Price = dataframe.Selling Price * 
ord(rand_str[2])/100. 
# drop column 
if ord(rand_ str[3]) % 2 == 
dataframe = dataframe.drop(['Car Name'], axis=1) 


return dataframe 


dataframe = customize dataset(dataframe raw, your name) 


Year Selling_Price Present Price Kms_Driven Fuel_Type Seller Type Transmission Owner 


296 1955.52 10.3550 11.60 33988 Diesel Dealer Manual 
233 1952.61 4.2510 5.70 53000 Diesel Dealer Manual 
15 1955.52 8.4475 10.79 43000 Diesel Dealer Manual 
289 1955.52 11.0199 13.60 10980 Petrol Dealer Manual 
155 1956.49 0.5232 0.51 4300 Petrol Individual Automatic 
dataframe.head() 


SULUI 5,8 Aile IL Lol LS I Res Lél css LS'edtel 


DIN 


0 


© © © oo 


eds (à 


DLL Le sages pliècul LS ES de Alpe AS out motcult oh (JR 


Ru 6 ca eu CEE 


Call; Bel, Dial se le (625 col ane el] LiRes Lol cioeseoll 
ob fus sf Las, sncf Le es (ail 


input_cols = ["Year","Present_Price","kKms Driven","Owner"] 


categorical cols = ["Fuel_ Type","Seller Type","Transmission"] 
output_cols = ["Selling Price"] 


CUUUI pua ni 
ml plis SU Jul, Ke PyTorch paésule cJlèal Lit Gi sie se LS 
PyTorch Ji SLI Les I rl Cul ULAI place Y I NI 
:NumPy ob ées Ji bol a Ds hsl où «Tensors 
def dataframe_ to_arrays(dataframe): 
# Make a copy of the original dataframe 
dataframe1 = dataframe.copy(deep=True) 
# Convert non-numeric categorical columns to numbers 
for col in categorical cols: 
dataframel[col] = 
dataframe1[col].astype('category').cat.codes 
# Extract input & outupts as numpy arrays 
inputs_array = dataframei[input_cols].to numpy() 
targets array = dataframei[output_cols].to numpy() 


return inputs array, targets array 


inputs_array, targets array = dataframe to_arrays(dataframe) 

inputs_array, targets_array 

SU NumPy Sbsbes JIM Jo test Jos oo 3) Sail DIU 6 385 

> ON ob sèes JOLI Dos LES DS) Se LS s ml 2e LE) ce 

LS) oi sal AU phil; «PyTorch lise A less ES co sèeal os 
lee (5e 5 ce QUE à sages 

inputs = torch.Tensor(inputs array) 


targets = torch.Tensor(targets array) 


dataset = TensorDataset(inputs, targets) 
train ds, val ds = random split(dataset, [228, 57]) 
batch_ size = 128 


train _loader = Dataloader(traiïin ds, batch size, shuffle=True) 


val _loader = DataLoader(val ds, batch size) 
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PyTorch &igoi Lui 
I ul, Se PyTorch ele des lost 53 et ele NI 


input_size = len(input_cols) 


output_size = len(output_cols) 


class CarsModel(nn.Module): 
def __init__ (self): 
super().__init__() 
self.linear = nn.Linear(input_ size, output _size) 


# fill this (hint: use input_size & output _ size defined above) 


def forward(self, xb): 
out = self.linear(xb) # fill this 


return out 


def training _step(self, batch): 
inputs, targets = batch 
# Generate predictions 
out = self(inputs) 
# Calcuate loss 
loss = F.11 loss(out, targets) # fill this 


return loss 


def validation step(self, batch): 
inputs, targets = batch 
# Generate predictions 
out = self(inputs) 
# Calculate loss 
loss = F.11 loss(out, targets) # fill this 


return {'val_loss': loss.detach()} 


def validation epoch_ end(self, outputs): 
batch_losses = [x['val_ loss'] for x in outputs] 
epoch_ loss = torch.stack(batch losses).mean() # Combine 
losses 


return {'val_loss': epoch loss.item()} 


QI roll &o il ul Jlouvi 6695 


def epoch_ end(self, epoch, result, num_epochs): 
# Print result every 20th epoch 
if (epoch+1) % 20 == @ or epoch == num epochs-1: 
print("Epoch [{}], val_ loss: {:.4f}".format(epoch+1, 
result['val loss'])) 


model = CarsModel() 


list(model.parameters()) 
ei os pliée LU mouse Ai nn.Linear WI cos cool AU oi 
DU Ke FI loss Di plu 5, lil Ole ge DV Res La 
est de hand ue 5 sillage eds 054 Jabel lÉss 55 dÙ F1 loss 
pal Le es Ju Y 2 S 
lui oil Gui eigoil Quai 
Ales ça (6 CN Bd A PLAIT dass ca A 28 4e 8 LI US I ON Los 
D pe 6 Lg) 


# Eval algorithm 
def evaluate(model, val _loader): 
outputs = [model.validation step(batch) for batch in val _loader] 


return model.validation epoch_ end(outputs) 


# Fitting algorithm 
def fit(epochs, 1r, model, train loader, val loader, 
opt_func=torch.optim.SGD): 
history = [] 
optimizer = opt_func(model.parameters(), 1r) 
for epoch in range(epochs): 
# Training Phase 
for batch in train loader: 
loss = model.training_ step(batch) 
loss.backward() 
optimizer.step() 
optimizer.zero grad() 
# Validation phase 
result = evaluate(model, val _loader) 
model.epoch_ end(epoch, result, epochs) 


history.append(result) 
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return history 


# Check the initial value that val _ loss have 
result = evaluate(model, val _loader) 


print(result) 


{'val_loss': 2300.039306640625} 


# Start with the Fitting 
epochs = 90 
ir = 1e-8 


history1 = fit(epochs, 1r, model, train loader, val _ loader) 


Epoch [28], val_ loss: 1692.0131 
Epoch [40], val_ loss: 1119.7253 
Epoch [68], val _ loss: 638.9708 
Epoch [80], val _ loss: 357.3529 
Epoch [98], val_ loss: 317.1693 


# Train repeatdly until have a ‘good' val loss 
epochs = 20 
1r = 1e-9 


history1 = fit(epochs, 1r, model, train loader, val loader) 


Epoch [20], val_ loss: 7.9774 


Jo pèse nil ea El 25 sal Jos pli es coll Je se «035 LS 
Ds pue co til Jesse plièul «SGD ss Al as le West ode (Gc call 
8j LS) LS JS Ans nds 5 VS das lens ele ls 8 Lusel| 
Clust uv Gui ego pla 
AN SN Il phaéeul 3922 El dde SUR sell A I tes dsl 
cd PE sl Cars SES UE «LUI de jogs dll sl Ju-5Yl ri D pK 
el oui 5 Lis JS pe EU ed (53938 pSseill os all del ge 
# Prediction Algorithm 
def predict_single(input, target, model): 
inputs = input.unsqueeze(@) 
predictions = model(inputs) # fill this 
prediction = predictions[@].detach() 
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print("Input:", input) 
print("Target:", target) 


print("Prediction:", prediction) 


# Testing the model with some samples 
input, target = val ds[e@] 


predict_ single(input, target, model) 

Input: tensor([1.9565e+03, 1.7800e+00, 4.0000e+03, @.0008e+00]) 
Target: tensor([1.7985]) 

Prediction: tensor([1.4945]) 


28 Los âgés Less Anis us esrall Gagne Me 4 opel üÙ «55 LS 
ES ail 59 27 (gd 4 ja Une ls JL Ses DVI Ie pue 23 52 

input, target = val ds[10] 

predict_ single(input, target, model) 

Input: tensor([1.9555e+03, 8.4000e+00, 1.2000e+04, 0.0008e+00]) 


Target: tensor([6.9768]) 
Prediction: tensor([-8.4869]) 


SN bel pliée, SI Jeu ge LES Joe Well eds cet 45 5,6 of (Hi 
-PyTorch pleut D all dsl SN 5 505 plseuls 
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Predict Fuel (QI ricil piañiuul 2989! ô-lè4 &ë95 (10 
Efficiency with Machine Learning 

Lol le pins Bol II DA 05 I Be BU SN el Se e pla os 3 
Sgel BUS GI LAS Le ble lol oje GAIN of aus he «8 Lermall 
QI hell JO pe 3 5 9 

2969) 6:18 49) 

QU Le pages NI bell oules on El LUI le pages code] La paéerute 
Jos but UT OLIS, AU 535 I Sgul S EN 550 LEY Auto MPG 
ges 8 où pe ol 5e al cos ph sal sage tés PLU ste 
ile» 8, 14l; displacement à-{;Yl; cylinders blu fee clean ca ji Ki 
-weight à; Jl; horsepower 

ogoll ojg s AU &,9,21l lea 5, ui Les 


matplotlib.pyplot plt 
pandas pd 
seaborn sns 


tensorflow tf 
from tensorflow keras 


from tensorflow.keras layers 

2e pas Je35 Usges ÉLSes LL Lo sages 55 58 dla ces GUN JE 6 AI «NI 
pandas da5> pleut OL» ç5 Les OL ie LUI 

column_names 


] 


LA LA 
, hames=column_ names, 
na_values , comment ; 


sep , Skipinitialspace ) 


plssul Tes 6 al Ml «categorical çs 53 SLI &e see (G'origin" 3525 
:one-hot encoding :,+LuI Le il 2x 
origin = dataset. (( ) 


dataset[ ] = (origin ) 
dataset[ ] = (origin 


dataset[ ] = (origin 


Ds Los le page I OUI M Les ol 
train dataset = dataset.sample (frac=0.8,random state=0) 
test _dataset = dataset.drop(train dataset.index) 
SU Je Les NI Let JO ce 9 8 1 Sul BUS Ge SENS Cul (5 
seaborn Zls5l ä, L plis 


sns.pairplot(train_dataset[["MPG", "Cylinders", "Displacement", 
"Weight"]], diag_kind="kde") 


EE 2 Lam aa OUI Lo some 359 2 all opel 6 Bagues a aile OS 
8 J1EUR pl le 75 sel Ces Lpaéule (21 3 all 


train _ Labels = train dataset.pop('MPG') 


test_labels = test_dataset.pop('MPG') 


SÜUUI àyguus 


pl de té CBS, le pas dl ol all standardize 4-55 (05 
Jess Lyme ST Lil James ls of M el el or D5s Dylés of 8e 5 set of 
EL pLal dl gts bol Ginall CUS LA de Loir el ci 
rade cor 6 SU m5 NT EL de some 2 34) 
def norm(x) 
(x - train_stats['mean']) / train_stats['std'] 


normed_train_ data = norm(train_ dataset) 


normed_test_data = norm(test_dataset) 


gt de ef pdt 3 sel CIN Lada 5 NT SUN a de (CG olal 
past Je 5 La) SU I LS YL Les I Ge sages ES ais ae à Les (sil 
EN des sil 
eagojl <Lu 
8 9 cils pd ob ee gel APT piste ba Les ses 2 Les 
UN (2 «build model .üls del LU) LS CUS pu ls Lei dre 
GY GU 5 se pie 
def build model() 
model keras.Sequential([ 
layers.Dense(64, activation=tf.nn.relu, 
input_shape=[len(train dataset.keys())]), 
layers.Dense(64, activation=tf.nn.relu), 


layers.Dense(1) 


D) 
optimizer = tf.keras.optimizers.RMSprop( 


model.compile(loss='mean_squared error', 
optimizer-optimizer, 
metrics=['mean_ absolute _error', ‘mean squared_error']) 
model 
model = build model() 


model.summary() 


Layer (type) Output Shape Param # 
dense_1 (Dense) (None, 64) 4169 
dense_2 (Dense) (None, 1) 65 


Total params: 4,865 
Trainable params: 4,865 


Non-trainable params: @ 


ON péoll out Be b sel Li qua Les 05 BR EU Phsel Cuu6 (LS OM 
example batch = normed train data[:10] 
example result = model.predict (example batch) 


example result 


@.12670723], 
0.03443428], 
0.3062502 ], 
0.3065169 ], 
0.36841604], 
@.02191851], 
0.3674207 ], 
0.10561748], 
0.00638346], 
0.00226454]], dtype=float32) 


ag8gJi 6 -Là4y Gui egoil ua 
DES 5 le 3 sel D Les OI 


class PrintDot (keras.callbacks.Callback): 
cerRontepochæendise br Mepoch Mo sS)E 
1e Corot MIO ENT) joritiote (0) 
Site (lol ice | 


EPOCHS = 1000 


history = model.fit( 
normed train data, train labels, 
EpPOChsSREOCRS valide tionsspE 02 verbpose 0} 


callbacks=[PrintDot()]) 


ll os y Se Les OI 


(history) 
(history.history) 
history.epoch 


.plot(histl'epoch'], hist['val_mean_absolute_error'], 
Label Val Eraciet) 
.ylim([0,5]) 
plt.legend() 


.figure() 

plt.xlabel('Epoch') 

plt.ylabel('Mean Square Error [$MPG"2$]') 

plt.plot(hist['epoch'], hist['mean_ squared_error'], 
Label='Train Error') 

plt.plot(hist['epoch'], hist['val_mean_squared_error'], 
Label WalMError) 

plt.ylim([0,20]) 

plt.legend() 

plt.shouw() 


plot_history(history) 


Li 100 Ji de Gill les Go gas 2e sf Goal Css ebiof LUI pou I Le Jos 
Gé de uses Ÿ Le Conti GZY modelfit i,L dus Les OV 
Je je 5 La JS Ca AL Le (sl EarlyStopping Sail 35 sl pas 

GU6 Li SUEL 65 cues Je) Ds lai ja uns 
model = build model() 


# The patience parameter is the amount of epochs to check for 
improvement 
early_stop Keras.callbacks.EarlyStopping(monitor='val loss', 
patience=10) 


history = model.fit(normed_ train data, train Labels, epochs=EPOCHS, 
validation split , verbose=0, 
callbacks=[early_ stop, PrintDot()]) 


plot_history(history) 
Se #/- 2 Je le Digi buse GS cpiell een Gel CU ul moe 
EN a ia Su Sie a (fa 0 
Dos die Leeds à ls EN Le some plu 73 seul pres ps RS 5 5 Use 
pu Gas Lie Le of 25sell Me de 5S de gi LI a més ei sill 
are) 


loss, mae, mse model.evaluate(normed_test_ data, test_Labels, 


("Testing set Mean Abs Error: {:5.2f} MPG".format(mae)) 


Testing set Mean Abs Error: 1.97 MPG 


DEN EUS EU sell le Es Lu Les OM 


test_predictions = model.predict(normed_test_data).flatten() 
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oi rl ,$ TensorFlow plat ail pin 


PS ll, STensorFlow ph al cètes 75 se El pate all ois 
me ES 9 QE Cane HS ed art all es plaèeuls Ldle sf Golf Lesf Le 
NN plat JS ce pouls OU de Ge (RU: 


.Kerass TensorFlow Hub peut Al JE gala al Le Élés-sls 
ŒUL seb 5e ki 50000 Gomes de 525 A IMDB SÙLS Le gages plate 
ce LU és 25000, Le as 25000 (A Loue Las es NT de PUY 
Laleu M lat all ce lune ue le css du les UE Vls cu juil cle sages 53134 

‘ At, 
sl Gb 5e TensorFlow plis ail cat Lolll Logoll oigs fu ol 
5 2 SSI 2x 


import numpy as np 


import tensorflow as tf 


Ipip install tensorflow-hub 

Ipip install tensorflow-datasets 
import tensorflow_hub as hub 
import tensorflow datasets as tfds 


( , tf.__version_ ) 
( n Elo ()) 
( 


, hub.__ version __) 
( ; af 
tf.config.experimental. 


) 


AIN es SN Le (EU de Le lyatul Cl SULI Le some Of 5e él Le 
des lies ul Sbf 2e La .TensorFlow pliée SLI bases blue à 35 
A lots QUI Jones bles poêle ON Le ylé pole col ce URI Le sage 

less ut le some 
# Split the training set into 60% and 40%, so we'll end up with 15,000 


examples 
# for training, 10,000 examples for validation and 25,000 examples for 


testing. 

train _ data, validation data, test _data = tfds. ( 
name= , 
split=( : 
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GbUUUI SUi4 ul 


ge De 10 Jai tele, LL ps5lu sans Jaune Le 30) OGLUI le 515 Ab Lies 
UN e some 


train examples batch, train labels batch = 
next(iter(train data.batch(10))) 


train examples batch 


DU Le sages je ben 10 Ji pd Les «0ŸI 


train labels. batch 


Output: <tf.Tensor: shape=(10,), dtype=int64, numpy=array([0, 0, 0, 1, 1, 1, 0, 0, 0, 0])> 
JAÏl Quoi 639.0) li 
a) à5 Less pt .TensorFlow plisl all ces Loge) 75505 #1 
2 Ni Les .TensorFlow Hub ct D ,x (sl, TensorFlow :;+ Lie Gns 
cils x dleprss deal Lezll STensorFlow Hub goss piéces Keras ib 
:JU-s5Yl 
embedding 


hub_layer = hub (embedding, input_shape=[], 


dtype=tf , trainable 


(train_examples_batch[:3]) 


SI SU Le same de pSsllele à5 ON 


tf.keras. 
(hub_layer) 


(tf.keras.layers. (16, activation 


(tf.keras.layers. (CG) 


() 


Model: "sequential" 


Layer (type) Output Shape Param # 
Cd Ne DU. 
dense (Dense) (None, 16) 336 
dense_1 (Dense) (None, 1) n7 


Total params: 400,373 
Trainable params: 400,373 


Non-trainable params: @ 


Qlretcül,s TensorFlow piaAiul ail äyini 


€2goil &Lo ni 
es CPS 9 8 plus jl 4ls plaèeul il compile tes este ON 


model (optimizer 


loss=-tf.keras.losses (from _logits 
metrics=[ 1) 


al ni 8 19-03 LU Jai 
er Le BUS 20 os ce 512 5 8 Re Ole sage Dia 20 Bd 5 saidl Con 
2e 10000 Le 4353 75 sad las Cl, «nil eléf y train ; x train sell le 
na)l js Giro de same 5 


history = model.fit(train data.shuffle(10000) .batch(512), 
epochs=20, 
validation data=validation data.batch(512), 
verbose=1) 


&2g-oJl muüi 
243919 8 plus Dire Ces toile Jai er cr Jen cases Less 


results model (test_data ( ), verbose=2) 


name, Value in (model.metrics_ names, results): 
( (name, value)) 


49/49 - 3s - loss: 0.3217 - accuracy: 0.8553 
loss: 0.322 
accuracy: 0.855 


Rés ME qu se 325 Ball 85 Les 15 June Lu Lau ès rise Gi EU 
put Gall Que pis Jo Jlèdl Me cel 4 0, of Gel pe 


.TensorFlow 
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Image QI ricül 48 TensorFlow plañüul Jgal ini (12 
Classification with TensorFlow in Machine Learning 

opel Lite Loge) Lnaull II pis Con Se LS 7 él lil sde (3 
TensorFlow ob .TensorFlow Le 221 SE SN ZLL .TensorFlow plis 
ge dps Jelé es pl Lou NT LU lis phaall à nés coulul els Le 3 Le 
AN Les Géo sleall l, I 4 ALU ons Al mermeell 5 hge 3 eS oil 3 el ps 
D peus JN Qi Le Rasa El 455 #03 3 lee 


SJgal ain ga Lo 

GS, Li5e dorsal ji JR oole sage Lauds tes Las ge 5) all Les 
gailesll oe ST its LE pleut Latest à 51 ons Ses iooee delà 
textural SI 5| spectral à il 


TensorFlow pol1àiüul Jgual ini 
Scikit-Learn ss dotis Mis JA ils plassut y pe Cène el 2 Lai es ON 
ei logo) Las IS LU LS «TensorFlow put .TensorFlow lili 
lu css 5e je ST Las Lasf GLEST LR Las 28 ele JE je all 
oo 2) pl 2e ul Gb de TensorFlow pleut y all cèes Loges ON 


# TensorFlow and tf.keras 
import tensorflow tf 
from tensorflow import keras 


# Helper libraries 
import numpy np 
import matplotlib.pyplot plt 


Fashion MNIST blu àc go al pui 
QU _ SNS MNIST SU, Ge sem) y DAS Fashion MNIST es 45 
US Lo goes 6 005 Lo suloull &5 0 NI ll mel. “Hello, World” Spies Le 
gr oll jpel les Genis Gel 42 1 40) A de LES eBN pe le MNIST 
-TensorFlow plié sell Las Legal liés SI 

fashion mnist = keras.datasets.fashion mnist 


(train _ images, train labels), (test images, test labels) = 
fashion mnist.load data() 


Jlricül à TensorFlow plaäiul à jguall gi 


DU ele Joss pal IL Eds Lans Gino JS one pis aol Lo some 5 
Dyel Ge de GAY ledit je Ras 2e La Les VI plis ob de page 


class_names [ 


CU &aJco 
Je Go LUI Elles Lan TensorFlow pléub os y pell cas Loge) ALL 
pi OÙ dau el Logo) JS LMI pat, ca 1j Lan SI ou 


12555 0 ss SI 


(@) 


(train_images[0]) 


() 
(False) 


(@) 


8 
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SU pie Jens JS 1 Ji 0 5e Ge LI ll ele Ge pus DVI pile 
DepAil de sages dlles Lou 255 de ll dus I gloss céllh LAN ieall 
Len Le DL SN e goes s 


train images = train images / 255.0 
test _ images = test images / 255.0 


SN Conde eLEN boleul ge Giles modll Gil SUUI of ge Gil 
eue Jr de see) 850 25 js QU pd « TensorFlow pin ) pail ès Aox)l 
ge JS Es all ul ox 
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(figsize=(10,10)) 


( 


(25e 
(5,594) 
([]) 
([]) 
) 


(train_images[il], 
(class_names[train_labels[i]]) 


cmap=plt.cm.binary) 


b 


Ankle boot 


5) 


Pullover 


T 


Fshirt/top 


$ 


Ankle boot 


Fshirt/top 


PL. 


Sneaker 


pi 


Ankle boot 


Li 


Touser 


Ex 
= 


Tshirttop 


Pullover 


Sandal 


T 


Fshirt/top 


Fshirt/top 


Dress TFouser 
€à9.j «li :TensorFlow plais jgal oui 
cat A ct «TensorFlow plisul ; all Lèses ogol &axll PAS | 


<> il compiling mes EE ga épis SEL 135 


Jul à TensorFlow plaäiul à jguall àgini 


Clérhil alac| 

ge Hal pli Gb de I es Lab ee Lea DRE LL I 
Le des OL (Jason plait creall el line Les Lle-sl eu CAN LOI 
nan RS CL +85 Les OV 


model = keras. (LC 
keras.layers. (input_shape=( 
keras.layers. ( , activation 


keras.layers. (10) 


1) 
@3goil &Loni 
Les 3o5 compiling #25 EURE 2 Les OI 
model (optimizer 


loss=-tf.keras.losses (from _logits 
metrics=[ 


€gigoil eu 25 :TensorFlow poli Jgl oxini 
35 «TensorFlow plu el Cène loge le Leaall SN O6 Les «off 
lle ol Jen 


#Fitting the Model 
model (train_images, train_ labels, epochs ) 
#Evaluating Accuracy 
test_Loss, test_acc model (test_images, test _ Labels, 
verbose=2) 
(( , test_acc) 


Test accuracy: 0.8817999958992004 


HMake Predictions 
probability model = tf.keras ([model, 


tf.keras.layers 
predictions = probability_ model (test_images) 
predictions[0] 


array([1.1349098e-09, 1.0395625e-09, 3.4154518e-10, 8.3033120e-12, 
6.5739442e-10, 5.9645530e-03, 9.4151291e-09, 1.1747092e-02, 
8.7000714e-08, 9.8228824e-01], dtype=float32) 


Us 5)p0ll di dei rl "confidence 4" Jess él Bi 10 ;ys dyèos ça és 
Le del de see Gil anal (65 Les Aèléeoll 8 ut pu Doll 2 JS a 
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np.argmax(predictions[0]) 


Output: 9 


si ankle boot JU «li 14e 85e 5) pll ojs où Bla ee gpl 0 «JUL, 
gone al Ne D] NI Lens ai nés class names [9] 


test _labels[0] 


LUS 5 pu saelue As LE 65e NI 


def plot_image(i, predictions_ array, true Label, img): 
true label, img = true Labell[i], imgli] 
plt.grid( ) 
plt.xticks([]) 
plt.yticks([]) 


plt.imshow(img, cmap=plt.cm.binary) 


predicted_label = np.argmax(predictions array) 
predicted_ Label true _ label: 
color blue‘ 


color ‘red' 


plt.xlabel("{} {:2.0f}% ({})".format(class_ names[predicted_ label], 
np.max(predictions_array), 
class _names[true label]), 
color=color) 


def plot_ value array(i, predictions array, true _ Label): 
true label = true label[il] 
plt.grid( ) 
plt.xticks(range(10)) 
plt.yticks([]) 
thisplot = plt.bar(range(10), predictions array, color="#777777") 
plt.ylim([0, 1]) 
predicted_label = np.argmax(predictions array) 


thisplot[predicted_label].set_color('red') 
thisplot[true label].set_color('blue') 


Jul à TensorFlow plaäiul à jguall gi 


HiguiUl po Gén UI 
Dyb eomall pl bas pol Jones be gl de O NI Cle 85 CAE Use 
a db Ball él oules GI 


i = 0 

plt.fiqgqure(figsize=(6,3)) 

iles smlosilot (il, 211) 

plotimagete = prechenionsriP Ces EME Re "Ces Cmimeges)) 
piles smanilot (il, 2 2) 

plotsvatuessrMt M predieEMonS IAE SCT RES) 
plt.show() 


Ankle boot 98% (Ankle boot 
bo sun, 0123456789 


# Plot the first X test images, their predicted Labels, and the true 
labels. 

# Color correct predictions in blue and incorrect predictions in red. 
num_rows 

num_cols 

num_images = num_rowsxnum_cols 

plt (Figsize=( num_cols, num_rows )) 


“ in (num_images ): 
plt (num_rows, num_cols, i+1) 
(i, predictions{i]l, test_Llabels, test_images) 
(num_rows, num_cols, i+2) 
(i, predictions[i], test_ Labels) 
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Ankle boot 98% (Ankle boot) 5133326789 Pullover 100% (Pullover) 0123480780 ouser 100% (Trouser) 0123456789 


Touser 100% (Trouser) Shirt 74% (Shirt) Touser 100% (Trouser) 


0123456789 0123456789 0123456789 


ne 


Cost 36% (Cont) 0123456789 bd nd 0123456789 SE nn 0123456789 
Sneaker 100% (Sneaker) 5133250700 Coat 89% (Coat) RENNES Sandal 100% (Sandal) RRRPRET ET 


CS 


Sandal 87% (Sneaker) REVERS Dress 100% (Dress) PRESTTRETU Coat 77% (Coat} 0123456789 


Df Jef Joluall Je LL Lei Le Lee Gel eu al LiNI Lis sl, El he 
NI hi pleut à nel Cène Jp Wal ae ol 35 0 6 


PyTorch plu JbipatcUl polañiu Joual Le «à pri 


PyTorch pla JU lc plais Jgal Qi Seul (13 
Image Recognition with Machine Learning using PyTorch 
Ge Qoues Eaall aie HS SL some sf ae CIS ge an der çof Lil Jde 
Je Se QUI 4e JR Lnneot Los au ON 25 Algu 345 See 3 es 
Sr à el 3e NI bel pléuts jai Le oeil de Ales We af. ll 
LU Je SAT Ge «Ua os optical character recognition > Je 
-PyTorch pleut NI ll sut ) nel le Sail 75 5 

SPyTorch ga Lo 

æln LS a PyTorch. PyTorch ss L A ré Les clogoll sis Goal 3 
PyTorch 555 cägue WU OL ie exil hell 75las LT Jeudl de Ja CA ob 
dela Gls ass lé PyTorch caf aus el Gel all al, Zeus 
goal pli As all Je Jan igall lBL I 


D plc plat Jgal QLe 6 pe 

&55 A TorchVision 43» pxrule Alaall ele Gupall, Le cel age) Lu 
x LS .AlexNet Je cs pull 45,1 élof Deal CRI ù Mail se x Le 
SU GI sdeluall &ls3Vls ImageNet Je SEL le same QI Su Vos 
-PyTorch sell 45) els de 


-torchvision.models Aus Bdoeall Fill Le ; sal Ses 


from torchvision models 


(models) 


l'AlexNet', 
"'DenseNet', 
'GoogLeNet', 
‘GoogLeNetOutputs, 
‘Inception3', 
‘InceptionOutputs, 
‘MNASNet', 
"MobileNetV2", 
"ResNet', 
"ShuffleNetV2', 
'SqueezeNet', 

"VGG", 
"_GoogLeNetOutputs', 
"_InceptionOutputs, 
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"_builtins_, 
"_cached_, 


__doc_ ", 
"_file ’, 


"_loader_, 
'_name_ 


us | 


"_package_, 
"_path 

'_spec_, 

_utils”, 

'alexnet', 

'densenet', 
'densenet121', 
'densenet161', 
'densenet169", 
'densenet201', 
"detection, 
‘googlenet", 
‘inception', 
‘inception_v3", 
"mnasnet', 
"mnasnetO 5", 
"mnasnet0O_75", 
"mnasnet1_0", 
"mnasnet1_3", 
"mobilenet", 
"mobilenet_v2', 
‘quantization', 
'resnet', 

‘resnet101', 
‘resnet152", 
‘resnet18", 
‘resnet34", 
‘resnet50", 
‘resnext101_32x8d', 
‘resnext50_32x4d', 
‘'segmentation', 
'shufflenet_v2_ x0_5, 
'shufflenet_v2_x1_0", 
'shufflenet_v2 x1_5", 
'shufflenet_v2_x2_0", 
'shufflenetv2", 
'squeezenet", 
‘squeezenet1_0", 
‘squeezenet1_1', 
'utils', 

V88', 

veg11, 

‘vgg11_bn', 


0 | 


PyTorch plaäiul JUrtcülplaiu Josall (te «à ci 


‘vgg19_bn', 

‘video, 

‘wide _resnet101_2', 
‘wide_resnet50_2"| 


A Eee ele LUE SI ne Aall LES CA DL ot I ST el 25 
ge diese cle pages CLS cal os pe Le (te bi! 5 Dlf axé Go dis 


«lle 

AlexNet 

pl LES La .AlexNet &ù (Lie eLil LS «Jolie de AlexNet £s Lai 
tOIne 


alexnet = models.AlexNet() 


pe DU RU (655 all os el AlexNet if js if 5» alexnet cäle el oi 5 
Sos pas GE 3m AlexNet 4% 4 Ji 5 ÿl Gel a GES vds Joli 
HIS aout 


8 pôle JE el po Ut dl OS Jess) SEL, alexnet 435 JE ce 
As deal LAS os NI Le sagall JE pe dell parus ET aa AS 
AU el Ge dnaall LOI pe LIEN 4e pageall II Lesle 35e JE pr 


ResNet 
101 5e BR LAID Les AUS Li LS] NI LS cresnet101 à L pisse 
Dai Jay ANT ge Cle eus pes poire ASCEU (Lhe eléol ON Las al 
1000, 5,5 date 1.2 pe cImageNet SU, 4e some le LS 5 2 resnet101 


ds 


resnet = models (pretrained ) 
resnet 


etes els pes EU PL 5e Ka JS AUS resnet sx eledrul 5 $es OI 
ré GOUND Léo 5 pronall paul, 536 vos JS y gel Liall eullaoll 
35 Si torchvision 5%; pl LI tes bi LU LS Lay GUl 

he pus Loulu VI LIN Lilaoll Jlss obl Lola does L parte ls D Y pour 


from torchvision transforms 
preprocess = transforms.Compose([ 
transforms.Resize( ), 


transforms.CenterCrop( ne 
transforms.ToTensor(), 
transforms.Normalize( 
mean=[ * 
std=[ ; 
)1) 


x 256 9 JUS Yl is als Le (see AI Rball auleoll lo Lou Hleul ads 3 

SES bonds 59 dl ebsess 63S all J> 224 x 224 415, all Lola, 256 

OÙ ls sons Lilus Lou DS en (oi nest cel) Le LLUIRGB 
bre 


Jai le cl 
Sp Cal Ji Les so pl , all Le à x Logo] 5,50 plisul ES ON 
es 25 «Pillow plié (dsl lil el ge 7e (ess se Ses SI 
D AU pal Blu 
from google.colab files 
uploaded = files.upload() 


PRIE Image 
Image.open("dog.png") 


Ÿ =, 


x 


Dal Le © 2 Lu Bell Laulleal QT Le Le 3e) 5 I pts «lis des 


img_t = preprocess(img) 


SAN Lens 55 (MN de el ages 3 lets; JM 55e RES 51e] I LR 


PyTorch polaäiul JUptcülplaiul Joaail (te «à ci 


torch 
torch. 


©) 


(batch_t) 


Jgl Lie «8 cl 8àg0j Jui 
ail 35155 ginference JYa NY ste SLR (Le je goss Jeris les oi 
HSM ès Ji los dde )sell de Gal 755 oltisul (lee Ji de «Graal 
Le gas OS Cène 1000 (de (552 GI lait (bass OV LE ll ès 


: ImageNet= LL 
( 
labels = [line. () 
PMindexe=arorche (out, 
percentage = torch.nn.functional. dim=1)[0] 


Labels[index[0]], percentagelindex[0]]. 


(‘golden retriever’, 96.29334259033203) 


girl ob ALI cd 6155 Gesell Sos I LUI LS at, RAA TANT 
hp) dr LS 58 ai] Es Le of à, &f 7/96 aus AS 


past NI lt placent y all de il Joe Wall oje col 35 0456 of (al 
-PyTorch 


ee) 
O1 
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Fashion Recommendation System <U ji äuagi poli (14 

GI NI pb plu LU Les els LE LAS Le EL Ge laut eds (3 
oi ée 5 all LS 5 LU Loesoll 5 SM Le Globe 5 Je Jess 
-recommendation system à sl els sal, 


De ES légume je Mall Gébaiol LLAL LU mess pli se Lol pli 
pelall plèsul 3 pie) CASE sut LU Loi 5 Je «le segell 
Le se LU Les QU es des Ji ll scpedéall gaz JE ce 2 SSI 
LiL 1 3 D, QUE galls pl salle LMI Ge laut 152S Lacs LU ESCelus oo 
DÉS L Les Lay sr Je 

JD roc &o <UjiU ämagi ol 

ei Soil Le La UNI Le 5 ds of au Nas Yet Se Le 
Jar AN Gible Lg ddl) Lei Ja sal ” Ji Hs . ont Gludl LL: s 
SUXS, el Gall lp je of ces Je ST LU Les eus Lil 
Al olgz ls Call dos s 25 sal Lol 5 


er let D te NI dl plait LU Ge 5 pU 4, DOS Ga Les 
ogall oigl Lez ci pol 


matplotlib.pyplot plt 
numpy np 
pandas pd 
gdown 
fastai.vision 


fastai.metrics accuracy, top_k_accuracy 
annoy AnnoyIndex 
zipfile 
time 
google.colab drive 
matplotlib inline 


URL ste Gal des «tb el google 4,2 DU mer I tes VI 
ÉLES 35 YA de so pere | Lis, og) 
# get the meta data 


url 
output 


gdown.download(url, output, quiet 


urt 
output 


<Ujiu äuagi ouai 


gdown.download(url, output, quiet=False) 
url = ‘'https://drive.google.com/uc?id=0B7EVK8rQ@v/71pdS1FMlNreEwtc1E" 


output = ‘list_eval_partition.txt' 
gdown.download(url, output, quiet=False) 


LL bi google 4, 4e ) hall az de (has Les ON 


# get the images 
root_path = './' 
url = 
‘https://drive.google.com/uc?id=1j5fCPgh0OgnY6v7ChkWlgnnHH6unxuAbb" 
output = ‘'img.zip' 
gdown.download(url, output, quiet=False) 
with zipfile.ZipFile("img.zip","r"}) as zip_ref: 
zip _ref.extractall(root_path) 


dés LL SU LE SUN CBS, QUUN ass oies ès ON p 35 


category_List = [] 
image_path_list = 
data_type List = [ 
# category names 
with open('list_category_cloth.txt', 'r') as f: 
for i, line in enumerate(f.readlines()): 
ab D Ea Le 
category _list.append(line.split(' ')[@]) 


[] 
] 


# category map 
with open('list_category_img.txt', ‘r'}) as f: 
for i, line in enumerate(f.readlines()): 
1e 1 Es 112 
image_path_list.append([word.strip() for word in 
Line.split(' ‘) if Len(word) 5gt; 0]) 


# train, valid, test 
with open('list_eval_partition.txt', ‘'r') as f: 
for i, line in enumerate(f.readlines()): 
1 ES Le 
data_type_ list.append([word.strip() for word in 
Line.split(' ‘) if Len(word) 5gt; 0]) 


data_df = pd.DataFrame(image path _ list, columns=['image path', 
"category_number']) 

data_df['category_number'] = data_df['category number'].astype(int) 
data_df = data_df.merge(pd.DataFrame(data_type List, 
columns=['image path', 'dataset_type']), on='image path') 
data_df['category'] = data_df[l'category_ number'].apply(lambda x: 
category _List[int(x) - 1]) 

data_df = data_df.drop('category_ number', axis=1) 


-embeddings Sal | all mer (hs ob 5, pe OS] 
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train_image List ImageList.from_ df(df=data_df, path=root_ path, 
cols ).split_by_idxs( 
(data_df[data_df[ ] ].index), 


(data_df[data_df[ ] ].index)).label_from df(cols 
) 

test_image List = ImageList.from df(df-data_df[data_df[ 
], path=root_path, cols 


train_image list.transform(get_transforms(), 
).databunch(bs ).normalize(imagenet_ stats) 
.add_test(test_image list) 
.show_batch(rows=3, figsize=(8,8)) 


Skirt 


Dress 


Skirt 


de Jrardi PyTorch ; resnet Se plisul Hell JAN lil 2x p 35e ON 


phil Je 


# see models available: https://docs.fast.ai/vision.models.html 


# many options for Resnet, the numbers are the number of layers. 


# More layers are generally more accurate but take longer to train: 
resnet18, resnet34, resnet50, resnet101, resnet152 
# get top 1 and top 5 accuracy 

train _ model(data, pretrained_ model, model _metrics): 

Learner cnn_learner(data, pretrained_ model, 
metrics-model_metrics) 

learner.model torch.nn.DataParallel(learner.model) 

learner.lr_find() 

learner.recorder.plot(suggestion ) 

Learner 


pretrained_ model = models.resnet18 # simple model that can be trained 
on free tier 
# pretrained model = models.resnet50 # need pro tier, model I used 


model_metrics [accuracy, partial(top_k_accuracy, k=1), 
partial(top_k_accuracy, k=5)] 

Learner train _model(data, pretrained_ model, model _metrics) 
learner.fit_one_ cycle(10, max_lr ) 


HI JE pis pi Les ON 


interp = ClassificationInterpretation.from_ learner(learner) 
interp.plot_top_losses(9, largest h figsize-=( 
heatmap_thresh=5) 


Prediction/Actual/Loss/Probability 


Dress/Dress / 0.00 / 1.00 Skirt/Skirt / 0.00 / 1.00 Skirt/Skirt / 0.00 / 1.00 


Skirt/Skirt / 0.00 / 1.00 Skirt/Skirt / 0.00 / 1.00 Dress/Dress / 0.00 / 100 
” = 
\ 
’ À , . 
Le) 
Skirt/Skirt / O 00 / 1.00 Skirt/Skirt / 0.00 / 1.00 Dress/Dress / 0.00 / 1.00 
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# saving the model (temporary, will lose model once environment 


resets) 
Llearner.save('resnet-fashion') 


only sou Fast AI 48 ,L pascale ol 


SaveFeatures( ): 
features 

__init__(self, m): 

self.hook = m.register_forward_hook(self.hook_fn) 

self.features 

hook_fn(self, module, input, output): 

out = output.detach().cpu().numpy() 
isinstance(self.features, type( DE 
self.features out 


self.features = np.row_ stack((self.features, out)) 
remove(self): 
self.hook.remove() 


# load the trained model 
load learner(data, pretrained_ model, model _metrics, model _ path): 
Learner cnn_learner(data, pretrained_ model, 
metrics-model_metrics) 
learner.model torch.nn.DataParallel(learner.model) 
learner = learner.load(model_path) 
learner 


pretrained_ model = models.resnet18 # simple model that can be trained 
on free tier 
# pretrained model = models.resnet50 # need pro tier 


model_metrics [accuracy, partial(top_k_accuracy, k=1), 
partial(top_k_accuracy, k=5)] 

# if gdrive not mounted: 

drive.mount('/content/gdrive') 


model_path "/content/gdrive/My Drive/resnet18-fashion" 

# model_path = "/content/gdrive/My Drive/resnet50-fashion" 
Learner load_learner(data, pretrained model, model _ metrics, 
model_path) 


olee LD past OV cedlef 5) Sail &loall 1e Le gi 45 0, of (si 
LU Le 5 el +LASY nearest neighbours 


# takes time to populate the embeddings for each image 
# Get 2nd last Layer of the model that stores the embedding for the 


image representations 
# the Last linear layer is the output layer. 


<Ujiu äuagi ouai 


saved_features = SaveFeatures(learner.model.module[1][4]) 
_= learner.get_preds(data.train_ds) 
_= learner.get_preds(DatasetType.Valid) 


dpdésnell des qe CEST 51) ere 124 LI Lead bee bob psë dsl 
ghost object Sels is (35 ONE JS gete2sll rè EMPATE ME 
doll Lola nor) LIN al Jos 


el SU Lg le cf Le all US de ES 


# prepare the data for generating recommendations (exlcude test data) 
# get the embeddings from trained model 
img_path = [str(x) for x in (List(data.train_ds.items) 
+list(data.valid ds.items))] 
Label = [data.classes[x] for x in (list(data.train_ds.y.items) 
+list(data.valid_ ds.y.items))] 
Label_id = [x for x in (list(data.train_ds.y.items) 
+list(data.valid_ ds.y.items))] 
data_df_ouput = pd.DataFrame({'img path': img_ path, 'label': label, 
"Label_id': Label_id}) 
data_df_ouput['embeddings'] = 
np.array(saved_ features.features).tolist() 
# Using Spotify's Annoy 
def get_similar_images_ annoy(annoy_ tree, img_index, 
number_of_items=12): 
start = time.time() 
img_id, img_ Label = data _df_ouput.iloc[img_ index, [Q, 1]] 
similar_img_ids = annoy_tree.get_nns_by_item(img_index, 
number_of_items+1) 
end = time.time() 
print(f'{(end - start) * 1000} ms') 
# ignore first item as it is always target image 
return img_id, img Label, data_df_ouput.iloc[similar_img_ids[1:]] 


# for images similar to centroid 
def get_similar_images_ annoy_ centroid(annoy_ tree, vector _ value, 
number_of_items=12): 

start = time.time() 

similar_img_ids = annoy_tree.get_nns_ by _vector(vector_ value, 
number_of_items+1) 

end = time.time() 

print(f'{(end - start) x 1000} ms') 

# ignore first item as it is always target image 

return data_df_ouput.iloc[similar img _ids[1:]] 


def show_similar_images(similar_ images df, fig _size=[10,10], 
hide _labels-True): 
if hide labels: 
category_List = [] 
for i in range(len(similar_ images df)): 


# replace category with blank so it wont show in display 


category _list.append(CategoryList(similar_ images _df[ ].values 
x0, 
] x 
(similar_images df)).get(i)) 
else: 
category list = [learner.data.train_ ds.y.reconstruct(y}) for y 

in similar_images df[ 11] 

return learner.data.show_xys([open_image(img_id) for img_id in 
similar_images_df[ Ill, 

category list, figsize-fig size) 
# more tree = better approximation 

ntree = 100 
#"angular", "euclidean", "manhattan", "hamming", or "dot" 
metric_choïice = 


annoy_tree = AnnoyIndex( (data_df_ouputl[ 1[0]), 
metric-metric_ choice) 


# # takes a while to build the tree 
for i, vector in (data_df_ouput[ 
annoy_tree.add_item(i, vector) 
= annoy_tree.build(ntree) 


Gp Les A5 sa amet I pts digue Le 5 es 4235 Les ol 
s'shorts" 5 ei blu Gles e à 6 eue sl 


(outfit_embedding_ list): 

number_of_outfits = outfit_embedding list.shapel[0Q] 

length_of_embedding = outfit_embedding list.shapel1] 

centroid = [] 

for i in (Length_of_embedding): 

centroid.append(np. (outfit_embedding_ listl[:, 

i])/number_of_outfits) 

return centroid 
# shorts 
outfit_img_ids = [109938, 106385, 113703, 98666, 113467, 120667, 
20840, 8450, 142843, 238607, 124505,222671] 
outfit_embedding List = [] 
for img_index in outfit_img_ids: 

outfit_embedding_ list.append(data_df_ouput.iloc[img_index, 3]) 


outfit_embedding_ list = np.array(outfit_embedding_ List) 
outfit_centroid_embedding = centroid_embedding(outfit_embedding List) 
outfits_selected = data_df_ouput.iloc[outfit_img_ids] 


similar_images df = get_similar_ images annoy_centroid(annoy_tree, 
outfit_centroid_ embedding, 30) 


mail Doll" lue 5 (se dsl 
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Named Entity (NER]) como qUA Qlc ôycil (15 
Recognition 

«sf sf ol of at he Cas EL Bo : À si 4æ Named Entity semall HLSII 
lazell oje "UNI les co y Us er : JE) de de cpu 4 pe «gi si dan 
US" D nel Lgalls NI el" 25 sol 3f Jets 'obdl" sul Le Giles 

slomell SULSI Le © xl deg SN el GONER) êlemoll SULSI Le ol 
OA Ge Gore 857 Délemoll LS Aer) Ladel ll axiles Loge (NER) 


SU, clngll Lez ske Grammarly el Se mb, plat EÙ Ge Ja 
EN célomall ELLSI pe LS Jai Ÿ LES gone Dell Gimomall L pe ll 
OUEN Je Dell Loge JS HT Gé Al of Gal Lula poses 

NI bell plié, (NER) élomal 


(NER) 2220)! QUAI Le «6 Je UU à LUI, JLon5 

#gà Jai ON La se Dogue Lagall lg lpotisule I OUI io some (55 GR 
LS pus Les EU .& Jus Le 4 nl Lee 8,85 el, LUI Las 32 alle 
“SEUI (Lesss ; pandas 


google.colab files 
uploaded = files.upload() 
pandas pd 


data pd.read_csv( , encoding 
data.head() 


Sentence # Word POS Tag 


0 Sentence: 1 Thousands NNS O 


1 NaN of IN O 
2 NaN demonstrators NNS O 
3 NaN have VBP O 
4 NaN marched VBN O 


(NER) couoJl AI QLe «8 pr 


Li 5 300 Janus eX Lee Jos test I Louis LS of (5 3 D Ses «LUI 
Y Loul ed 5 

an lé GÜUUI pui 

Le op EUX. (NER) sal DES Le &5 xl dage le Las 4505 Dante 8 
Dogue Doles of Re os de al og Leslie Y SU le ll 2x ele I 
aa NI ou Lolleall lat ét 3 5 os sl iiyloe ASS pue 


from itertools import chain 

def get_dict_map(data, token_or_ tag): 
tok2idx = {} 
idx2tok = {} 


if token_or_ tag == ‘token': 

vocab = list(set(datal'Word'].to_list())) 
else: 

vocab = list(set(datal['Tag'].to_list())) 


idx2tok = {idx:tok for ïidx, tok in enumerate(vocab)} 
tok2idx = {tok:idx for ïidx, tok in enumerate(vocab)}} 
return tok2idx, idx2tok 
token2idx, idx2token = get_dict_map(data, 'token') 
tag2idx, idx2tag = get _dict map(data, 'tag') 


nan ER nat QUI 7 Y SU êtes VI box DVI 8 tu 


datal['Word_idx'] = datal['Word'].map(token2idx) 

data['Tag_idx'] = datal['Tag'].map(tag2idx) 

data_fillna = data.fillna(method='ffill', axis=0) 

# Groupby and collect columns 

data_group = data_fillna.groupby( 

['Sentence #'],as_index=False 

)L'Word', "POS', 'Tag', 'Word_idx', 'Tag_idx'].agg(lambda x: list(x)) 
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from sklearn.model_ selection import train test _ split 
from keras.preprocessing.sequence import pad_sequences 
from keras.utils import to_categorical 


def get_pad_train_test_val(data_ group, data): 


#Hget max token and tag Length 
n_token = Llen(list(set(datal['Word'].to_list()))) 
n_tag = Len(list(set(datal'Tag'].to list()))) 


#Pad tokens (X var) 


tokens = data_groupl ].tolist() 

maxlen ([Len(s) for s in tokens]) 

pad_tokens = pad_sequences(tokens, maxlen=maxlen, dtype= 
padding= , Value= n_token - 1) 


#Pad Tags (y var) and convert it into one hot encoding 

tags = data_groupl ].tolist() 

pad_tags = pad_sequences(tags, maxlen=maxlen, dtype-= 
padding= , Value= tag2idx[ 1) 

n_tags = (tag2idx) 

pad_tags = [to_categorical(i, num classes=n_ tags) for i in 
pad_tags] 


#Split train, test and validation set 

tokens_, test_tokens, tags _, test_tags - 
train test _split(pad_tokens, pad_ tags, test_size-0.1, train size=0.9, 
random_state=2020) 

train_tokens, val_tokens, train _ tags, val_ tags = 
train _test_split(tokens_ ,tags_ ,test_size = Q.25,train size =0.75, 
random_state=2020) 


print( 
(train_tokens), 
: (train_tokens), 
: (test_tokens), 
(test_tags), 
(val_tokens), 
(val_tags), 
) 


return train _ tokens, val_tokens, test_tokens, train _ tags, 
val_tags, test _tags 


train_tokens, val_tokens, test_tokens, train tags, val _ tags, test_tags 
= get_pad_ train _test_val(data group, data) 


train_tokens length: 32372 
train_tokens length: 32372 
test_tokens length: 4796 
test_tags: 4796 
val_tokens: 10791 val_tags: 10791 


Let A pool raz 91 out QUE Les pas] Lau SCT LS Cu 06 Ge le «0VI 
aa Lau uns) 
import numpy as np 


import tensorflow 
from tensorflow.keras import Sequential, Model, Input 


from tensorflow.keras.layers import LSTM, Embedding, Dense, 
TimeDistributed, Dropout, Bidirectional 
from tensorflow.keras.utils import plot_ model 
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from numpy.random import seed 
seed(1) 
tensorflow.random.set_seed(2) 


Gal ls Je al bxus LSTM D ce sel bof 2 jt 
input_dim = len(list(set(datal'Word'].to_Llist())))+1 
output_dim = 64 


input _ Length = max([len(s) for s in data_groupl'Word_ idx'].tolist()]) 
n_tags = Len(tag2idx) 
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def get_bilstm_lstm_model(): 
model = Sequential() 


# Add Embedding layer 
model.add(Embedding(input_dim=input dim, output _dim=output_dim, 
input_Length=input_length)) 


# Add bidirectional LSTM 
model.add(Bidirectional(LSTM(units=output_dim, 
return _sequences=True, dropout=0.2, recurrent_dropout=0.2), merge_ mode 
= iece)) 


# Add LSTM 
model.add(LSTM(units=output_dim, return _sequences=True, 
dropout=0.5, recurrent_dropout=0.5)) 


# Add timeDistributed Layer 
model.add(TimeDistributed(Dense(n_ tags, activation="relu"))) 


#Optimiser 
# adam = k.optimizers.Adam(lr=0.0005, beta_1=0.9, beta _2=0.999) 


# Compile model 

model.compile(loss='categorical crossentropy', optimizer='adam', 
metrics=['accuracy']) 

model.summary() 


return model 
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def train model(X, y, model ): 
loss = List() 
for i in range(25): 


# fit model for one epoch on this sequence 
hist = model.fit(X, y, batch_size-1000, verbose=1, epochs=1, 
validation _split=0.2) 


loss.append(hist.history[ 


Loss 


:294 jôl Ja)l 
results = pd.DataFrame() 
model_bilstm_lstm = get_bilstm_lstm_ model() 
plot_model(model_bilstm_lstm) 
results[ ] 


train model(train_tokens, 
np.array(train_tags), model _bilstm_Lstm) 
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spacy 
spacy displacy 


nlp = spacy.load( 
text = nlp( 


displacy.render(text, style , jupyter 


Hi, My name is MAmanKharwalm PERSON 


| want to work with Google oRG 


Steve-JObS“ PERSON“ is My Inspiration 
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import collections 

import helper 

import numpy as np 

import project _tests as tests 

from keras.preprocessing.text import Tokenizer 

from keras.preprocessing.sequence import pad_sequences 


from keras.models import Model 

from keras.layers import GRU, Input, Dense, TimeDistributed, 
Activation, RepeatVector, Bidirectional 

from keras.layers.embeddings import Embedding 

from keras.optimizers import Adam 

from keras.losses import sparse categorical_ crossentropy 
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import os 
def load data(path): 


Load dataset 

input_file = os.path.join(path) 

with open(input_file, "r") as f: 
data = f.read() 


return data.split('\n') 
def _test_model(model, input_shape, output sequence Length, 
french_vocab_ size): 
if isinstance(model, Sequential): 
model = model.model 


assert model.input_shape == (None, xinput_shapel[1:]),\ 
‘Wrong input shape. Found input shape {} using parameter 
input_shape={}'.format(model.input_ shape, input_shape) 


assert model.output_shape == (None, output sequence Length, 
french_vocab_size),\ 
Wrong output shape. Found output shape {} using parameters 
output _sequence_Length={} and french_vocab size={}'\ 
.format(model.output_shape, output _ sequence Length, 
french_vocab_ size) 
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assert Len(model.loss_ functions) 5amp;gt; 0,\ 
"No loss function set. Apply the compile function to the 


model." 


assert sparse categorical_crossentropy in model.loss_ functions, \ 
"Not using sparse categorical crossentropy function for Loss. 
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english_sentences = helper.load_ data('data/small_vocab_en') 
french_sentences = helper.load_data('data/small_vocab_ fr') 


print('Dataset Loaded) 


UE 2e gogms Jess pi 


for sample_i in range(2): 
print('small_vocab_en Line {}: {}'.format(sample_i + 1, 
english_sentences[sample_il)) 


print('small_vocab fr Line {}: {}'.format(sample_i 
french_sentences[sample_i])) 


small vocab_en Line 1: new jersey is sometimes quiet during autumn , and it is snowy in april . 
small vocab fr Line 1: new jersey est parfois calme pendant l'automne , et il est neigeux en avril. 
small vocab_en Line 2: the united states is usually chilly during july , and it is usually freezing in 
november . small vocab fr Line 2: les états-unis est généralement froid en juillet , et il gèle 
habituellement en novembre. 
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english_words_ counter = collections.Counter([word for sentence in 

english_sentences for word in sentence.split()]) 

french_words_ counter = collections.Counter([word for sentence in 

french_sentences for word in sentence.split()]) 

print('{} English words.'.format(len([word for sentence in 

english_sentences for word in sentence.split()]))) 

print('{} unique English words.'.format(len(english words _ counter))) 

print('10 Most common words in the English dataset:') 

DTA E CMS 
.join(list(zip(xenglish_ words counter.most_common(10)))[@]) + '"') 

print() 

print('{} French words.'.format(len([word for sentence in 

french_sentences for word in sentence.split()]))) 

print('{} unique French words.'.format(len(french words _counter))) 

print('10 Most common words in the French dataset:') 

Dire (CR + 

"',join(list(zip(xfrench_ words counter.most_common(1@)))[@]) + '"') 


1823250 English words. 
227 unique English words. 
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10 Most common words in the English dataset: 


"js" us "in" "it" "during" "the" "but" "and" "sometimes" 


1961295 French words. 

355 unique French words. 

10 Most common words in the French dataset: 
"est en" "il" "les" "mais" "et" "la" "parfois" 


CORTE TE un 
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def tokenize(x): 
x_tk = Tokenizer(char_ level = False) 
x_tk.fit_on_texts(x) 
return x_tk.texts_ to sequences(x), x_tk 
text_sentences = [ 


"The quick brown fox jumps over the Lazy dog .', 
"By Jove , my quick study of Lexicography won a prize . 
‘This is a short sentence .'] 


text_tokenized, text_tokenizer = tokenize(text_sentences) 
print(text_tokenizer.word_index) 
print() 
for sample_i, (sent, token_ sent) in enumerate(zip(text_sentences, 
text_tokenized)): 

print('Sequence {} in x'.format(sample_i + 1)) 

print(' Input: {}'.format(sent)) 

print(' Output: {}'.format(token_sent)) 


{'the’: 1, quick": 2, 'a": 3, brown’: 4, ‘fox': 5, ‘jumps': 6, 'over': 7, ‘lazy': 8, 'dog': 9, by": 10, ‘jove': 11, 
‘my’: 12, ‘study': 13, ‘of: 14, ‘lexicography": 15, ‘won': 16, 'prize": 17, ‘this’: 18, ‘is': 19, short": 20, 
‘sentence': 21} 


Sequence 1 in x 
Input: The quick brown fox jumps over the lazy dog. 
Output: [1, 2, 4, 5, 6, 7, 1, 8,9] 

Sequence 2 in x 
Input: By Jove , my quick study of lexicography won a prize. 
Output: [10, 11, 12, 2, 13, 14, 15, 16, 3,17] 

Sequence 3 in x 
Input: This is a short sentence. 


Output: [18, 19, 3, 20, 21] : 
sgh nds UN JS Jens phonll 4 Le paires OÙ Les 


def pad(x, Length=None): 
if Length is None: 
length = max([len(sentence) for sentence in x]) 
return pad_sequences(x, maxlen = length, padding = 'post') 
tests.test_pad(pad) 


# Pad Tokenized output 
test_pad = pad(text_tokenized) 
for sample_i, (token_ sent, pad_ sent) in enumerate(zip(text_tokenized, 
test_pad)): 
print('Sequence {} in x'.format(sample_i + 1)) 


101 U9ÈU pla iul GLocil roll (6 Leg jus 20 


print(' Input: {}'.format(np.array(token_ sent))) 
print(' Output: {}'.format(pad_sent)) 

Sequence 1 in x 

Input: [124567189] 

Output: [1245671890] 

Sequence 2 in x 

Input: [10 11 12 2 13 1415 16 317] 

Output: [10 11 12 213 14 15 16 317] 

Sequence 3 in x 

Input: [1819 32021] 

Output: [1819 3202100000] 
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def preprocess(x, y): 
preprocess x, x_tk = tokenize(x) 
preprocess_y, y_tk = tokenize(y) 
preprocess_ x = pad(preprocess_x) 
preprocess_y = pad(preprocess_y) 
# Keras's sparse categorical_crossentropy function requires the labels 
to be in 3 dimensions 
preprocess_y = preprocess_y.reshape(xpreprocess_y.shape, 1) 
return preprocess_x, preprocess_y, x_tk, y_tk 
preproc_english sentences, preproc_french sentences, 
english_tokenizer, french tokenizer =\ 
preprocess(english_sentences, french _sentences) 


max_english_ sequence length = preproc_english_sentences.shapel[1] 
max_french_ sequence Length = preproc_french_sentences.shapel[1] 
english_vocab_ size = len(english_ tokenizer.word_index) 

french _vocab_ size = Len(french_ tokenizer.word_index) 

print('Data Preprocessed') 

print("Max English sentence Length:", max_english_ sequence length) 
print("Max French sentence Length:", max_french_ sequence length) 
print("English vocabulary size:", english vocab_ size) 
print("French vocabulary size:", french vocab_ size) 


Data Preprocessed 
Max English sentence length: 15 
Max French sentence length: 21 
English vocabulary size: 199 
French vocabulary size: 344 
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def logits_ to text(logits, tokenizer): 
index_to_ words = {id: word for word, id in 

tokenizer.word_index.items()} 
index_to_words[0] = ‘&amp;lt;PAD&amp;gt;" 


return ",join([index_to_ words[prediction] for prediction in 
np.argmax(logits, 1)]) 
print(' logits_ to _ text function loaded.') 


sé os it eigo CE 
‘logits_to_text' function loaded. 
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def (input_shape, output_sequence_ length, 
english_vocab_ size, french _ vocab_ size): 
Learning_rate = 1e-3 
input_seq = Input(input_shapel[1:]) 
rnn = GRU(64, return_sequences = True)(input_seq) 
logits = TimeDistributed(Dense(french_ vocab_size))(rnn) 
model = Model(input_seq, Activation( )(Logits)) 
model. (loss = sparse categorical _crossentropy, 
optimizer = Adam(learning_ rate), 


metrics = i 1) 


return model 
tests.test_ simple model(simple_ model) 
tmp_x pad(preproc_english_sentences, max_french sequence length) 
tmp_x = tmp_x.reshape((-1, preproc_ french sentences.shapel[-2], 1)) 
# Train the neural network 
simple_rnn model = simple_model( 

tmp_x.shape, 

max_french_ sequence _ length, 

english_vocab_ size, 

french_vocab_ size) 
simple _rnn model.fit(tmp_x, preproc_french_ sentences, batch_size=1024, 
epochs=10, validation _split=0.2) 
# Print prediction(s) 
print(logits_ to _text(simple_ rnn_ model.predict(tmp_x[:1])[01], 
french_tokenizer)) 


Train on 110288 samples, validate on 27573 samples 
Epoch 1/10 
110288/110288 


mm 


- 9s 82us/step - loss: 3.5162 - acc: 0.4927 - val_loss: 


nan - val_ acc: 
Epoch 2/18 

110288/110288 [==============================] - 7s 6dus/step - loss: 2.4823 - acc: 0.4655 - val_loss 
nan - val acc: 0.4838 
Epoch 3/18 


110288/110288 - 7s 63us/step - loss: 2.2427 - acc 


© 


.5016 - val_loss: 


nan - val_acc: 
Epoch 4/10 
110288/110288 [==============================] - 7s 64us/step - loss: 


nan - val_acc: 0.5428 


1 


.0188 - acc: 0.5230 - val _ loss: 


Epoch 5/10 


110288/110288 7s 64us/step - loss: 1.8418 - acc: 0.5542 - val_ loss 
nan val_acc 

Epoch 6/18 

110288/110288 [==============================] - 7s 6d4us/step - loss: 1.7258 - acc: 0.5731 - val_ loss: 


nan - val_acc: 0.5811 
Epoch 7/18 
110288/110288 


- 7s 64us/step - loss: 1.6478 - acc: 0.5871 - val_loss: 


Epoch 8/18 

110288/110288 [==============================] - 7s 64us/step - loss: 1.5850 - acc: @.5949 - val_loss: 
nan - val_ acc: 0.5977 

Epoch 9/10 

110288/110288 \[=================-----] - 7s 6Aus/step - loss: 15320 - acc: 6.5996 - val loss: 
nan - val_acc: 0.6027 

Epoch 18/10 

110288/110288 [==============================] - 7s 64us/step - loss: 1.4874 - acc: 0.6037 - val_loss: 
nan - val_acc: @.6039 

new jersey est parfois parfois en en et il est est en en <PAD> <PAD> <PAD> <PAD> <PAD> <PAD> <PAD> 
<PAD> 
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from keras.models import Sequential 
def (input_shape, output_sequence length, 
english_vocab_ size, french _ vocab size): 
Llearning_rate = 1e-3 
rnn = GRU(64, return _sequences=True, activation= 


embedding = Embedding(french_vocab_ size, 64, 
input_Llength=input_shapel[1]) 

logits = TimeDistributed(Dense(french_ vocab size, 
activation= )) 


model = Sequential() 

#em can only be used ïin first layer --&5amp;gt; Keras Documentation 
model.add(embedding) 

model.add(rnn) 

model.add(logits) 

model. (loss=sparse categorical_crossentropy, 


optimizer=Adam(learning_ rate), 
metrics=[ 1) 


return model 
.test_embed_ model(embed_model) 
pad(preproc_english_sentences, max_french sequence length) 
tmp_x.reshape((-1, preproc_french_ sentences.shape[-2])) 
embeded_ model = embed_model( 
tmp_x.shape, 
max_french_ sequence length, 
english_vocab_ size, 
french_vocab_ size) 
embeded_ model.fit(tmp_x, preproc_ french sentences, batch _size=1024, 
epochs=10, validation _split-=0.2) 
print(logits_to_text(embeded_ model.predict(tmp_x[:1])[0], 
french_tokenizer)) 
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Train on 110288 samples, validate on 27573 samples 
Epoch 1/18 
110288/110288 [== 
nan - val_acc: 0.4093 
Epoch 2/10 
110288/110288 
nan - val_acc: 
Epoch 3/10 
110288/110288 
nan - val_ acc: 
Epoch 4/10 
110288/110288 
nan - val_acc: 
Epoch 5/10 
110288/110288 
nan - val_acc: 
Epoch 6/10 
110288/110288 
nan - val_acc: 
Epoch 7/10 
110288/110288 
nan - val_acc: 
Epoch 8/10 
110288/110288 
nan - val_acc: 
Epoch 9/10 
110288/110288 
nan - val acc: 0.8313 
Epoch 19/10 
110288/110288 [== 
nan - val_acc: 0.8401 


- 8s 68us/step - loss: 3.7877 - acc: 0.4018 - val_loss: 


- 7s 65us/step - loss: 2.7258 - acc: 0.4382 - val_ loss: 


- 7s 65us/step - loss: 2.0359 - acc: 0.5453 - val _ loss: 


- 7s 65us/step - loss: 1.4586 - acc: 0.6558 - val_loss: 


- 7s 65us/step - loss: 1.1346 - acc: 0.7308 - val_loss: 


- 7s 65us/step - loss: 0.9358 - acc: 0@.7681 - val_ loss: 


- 7s 65us/step - loss: @.8057 - acc: @.7917 - val_loss: 


- 7s 65us/step - loss: @.7132 - acc: @.8095 - val_ loss: 


- 7s 65us/step - loss: 0.6453 - acc: 0.8229 - Val_loss: 


- 7s 64us/step - loss: 0@.5893 - acc: @.8355 - val_ loss: 


new jersey est parfois calme au l'automne et il il est neigeux en en <PAD> <PAD> <PAD> <PAD> <PAD> 
<PAD> <PAD> 
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import time 

import cv2 

import os 

import random 

import numpy as np 

import matplotlib.pyplot as plt 
from PIL import Image 

import imutils 

import matplotlib.image as mpimg 
from collections import OrderedDict 
from skimage import io, transform 
from math import *x 

import xml.etree.ElementTree as ET 


import torch 

import torchvision 

import torch.nn as nn 

import torch.optim as optim 

import torch.nn.functional as F 

import torchvision.transforms.functional as TF 

from torchvision import datasets, models, transforms 
from torch.utils.data import Dataset 

from torch.utils.data import DataLoader 
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**capture 

if not 

os.path. ( DE 
lwget 


http://dlib.net/files/data/ibug 300W Large face Llandmark_dataset.tar.g 
z 


ltar -xvzf 
{rm -r 
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file 

open( 'ibug_300W large _face_landmark_dataset/helen/trainset/10003254Q 1 
DES) 

points = file.readlines()[3:-1] 


Landmarks [] 


point points: 
X,y = point.split(' ') 
Landmarks.append([floor(float(x)), floor(float(y[:-1]))]) 


Landmarks = np.array(Landmarks) 


plt.figure(figsize=(10,10)) 
plt.imshow(mpimg.imread('ibug_ 300W large face _landmark_dataset/helen/t 
rainset/100032540 1.jpg')) 

plt.scatter(landmarks[:,0], Llandmarks[:,1], s 

plt.show() 
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class Transforms( ): 
def (self): 
pass 


(self, image, landmarks, angle): 
angle = random.uniform(-angle, +angle) 


transformation matrix = torch.tensor([ 
[+cos(radians(angle)), -sin(radians(angle))], 
[+sin(radians(angle)), +cos(radians(angle))] 


1) 
image = imutils.rotate(np.array(image), angle) 


Landmarks = Landmarks - @.5 

new_landmarks = np.matmul(landmarks, transformation matrix) 
new_landmarks = new_lLandmarks + Q.5 

return Image.fromarray(image), new_landmarks 


(self, image, landmarks, img_size): 
image = TF.resize(image, img_size) 
return image, landmarks 


(self, image, Llandmarks): 
color_jitter = transforms.ColorJitter(brightness-0.3, 
contrast=0.3, 
saturation=0.3, 
hue=0.1) 


image = color_jitter(image) 
return image, landmarks 


(self, image, Llandmarks, crops): 
left = (crops[ 1) 
top = (crops[ 1) 
width = (crops[ 1) 
height (crops[ 1) 


image = TF.crop(image, top, left, height, width) 


img_shape = np.array(image).shape 

Landmarks = torch.tensor(landmarks) - torch.tensor([[left, 
top]1) 

Landmarks = landmarks / torch.tensor([img_shapel1], 
img_shape[0]]) 

return image, landmarks 


def (self, image, Llandmarks, crops): 

image = Image.fromarray(image) 

image, Landmarks self.crop_face(image, landmarks, crops) 
image, Landmarks = self.resize(image, Landmarks, (224, 224)) 
image, Landmarks = self.color_jitter(image, landmarks) 
image, landmarks self.rotate(image, landmarks, angle=10) 


image = TF.to_tensor(image) 
image = TF.normalize(image, [0.5], [@.5]) 
return image, landmarks 


class FaceLandmarksDataset(Dataset ): 
def (self, transform-None): 


tree 
ET.parse( 
) 


root = tree.getroot() 


self.image_filenames = [] 
self.landmarks = [] 
self.crops = [] 
self.transform = transform 
self.root_dir = 


for filename in root[2]: 
self.image filenames.append(os.path.join(self.root_dir, 
filename.attrib[ 1)) 


self.crops.append(filename[@].attrib) 


Llandmark = [] 

for num in (68): 
x_coordinate = (filename[@][num].attribl 1) 
y_coordinate = (filename[@][numl.attribl'y']) 
landmark.append([x_coordinate, y_coordinate]) 

self.landmarks.append(landmark) 


self.landmarks = np.array(self.landmarks).astype( 


assert (self.image_filenames) == (self.landmarks) 


(self): 
return (self.image_filenames) 


(self, index): 
image = cv2.imread(self.image filenames[index], 0) 


Landmarks = self.landmarks[indexl] 


if self.transform: 
image, Llandmarks = self.transform(image, Landmarks, 


self.crops[index]) 


Landmarks = Llandmarks - Q.5 


return image, landmarks 


dataset = FaceLandmarksDataset(Transforms()) 


De di SULI Lo sage pou pu UNI > à Les Le Le age 5,15 Ab Les OS 
HU de pagua) oof 8, Sol ol 0 5 pu Sd Lol el ,>) JA 


ge dire QU, Le soges 25 Le page I OLLUI Be some pile Le aol) NI 
4.2) 


# split the dataset into validation and test sets 
Llen_valid_ set - (@.1xlen(dataset)) 


len_train_ set = (dataset) - Len_ valid set 


aagJl roJlco cuï14 


print( : (Len_train_set)) 
print( : (Llen_valid_set)) 


train _dataset , valid dataset, = 
torch.utils.data.random split(dataset , [len_ train set, 
len_valid_set]) 


# shuffle and batch the datasets 

train_loader = torch.utils.data.DatalLoader(train dataset, 
batch_size-64, shuffle-True, num workers=4) 

valid loader = torch.utils.data.DataLoader(valid_ dataset, batch size=8, 
shuffle-True, num_workers=4) 


The length of Train set is 6000 
The Length of Valid set is 666 


images, Landmarks (train_loader)) 


print(images.shape) 
print(landmarks.shape) 


torch.Size([64, 1, 224, 224]) 
torch.Size([64, 68, 2]) 


PD RARE ps Lu al Je LS ResNet18 pile oi 
Los pe Use SEA La es As 


class Network(nn.Module): 
def (self,num_classes-136): 
Co -rmite () 

self.model_name-= 

self.model=models.resnet18() 

self.model.convi=nn.Conv2d(1, 64, kernel _size=7, stride=2, 
padding=3, bias=False) 

self.model.fc=nn.Linear(self.model.fc.in_ features, 
num_classes) 


def (self, x): 


x=self.model(x) 
return x 


import sys 


def (step, total step, loss, operation): 
sys.stdout.write( ) 
if operation == 
sys.stdout.write( 
total_ step, loss)) 
else: 


U9ÜU plais GLocii roll (6 Leg jus 20 


sys.stdout.write("Valid Steps: %d/%d Loss: %.4f " % (step, 
total_ step, loss)) 


sys.stdout.flush() 


angl leo SUiAU ani à Auiul 4 Ja 
HAa8 golls Raoul ag gl ls one dau srall as 5 Ua GI pacte 


torch.autograd.set_detect_anomaly(True) 
network = Network() 
network.cuda() 


criterion = nn.MSELoss() 
optimizer = optim.Adam(network.parameters(), Lr-0.0001) 


loss min = np.inf 
num_epochs = 10 


start_time = time.time() 
for epoch in range(1,num_epochs+1): 


Loss_traïin = 


Q 
Loss_valid = Q@ 


running_loss = Q 


network.train() 
for step in range(1,len(train Loader)+1): 


images, Landmarks = next(iter(train_loader)) 


images = images.cuda() 
Landmarks = landmarks.view(landmarks.size(@),-1).cuda() 


predictions = network(images) 


# clear all the gradients before calculating them 
optimizer.zero_ grad() 


# find the loss for the current step 
loss_train_step = criterion(predictions, landmarks) 


# calculate the gradients 
loss_train_step.backward() 


# update the parameters 
optimizer.step() 


loss_train += loss train_step.item() 
running_loss = loss _train/step 


print_overwrite(step, len(train_loader), running_ loss, 
'train') 


network.eval() 


angl rollco cuir 4 


with torch.no_grad(): 
for step in range(1,len(valid Loader)+1): 
images, Landmarks = next(iter(valid_ loader)) 


images = images.cuda() 
Landmarks = landmarks.view(landmarks.size(@),-1).cuda() 


predictions = network(images) 


# find the loss for the current step 
loss_valid_ step = criterion(predictions, Landmarks) 


loss_valid += loss valid _step.item() 
running_loss = loss _valid/step 


print_overwrite(step, len(valid_ loader), running_ loss, 
'valid') 


loss_train /= len(train _Loader) 
Lloss_valid /= len(valid_ Loader) 


print(tEpoch: {Train loss: 4F}. Valid loss: 
{:.4f}'.format(epoch, Loss_ train, loss _valid)) 


if loss valid < loss min: 

Loss_min = Loss_ valid 

torch.save(network.state dict(), 
"/content/face landmarks.pth') 

print("\nMinimum Validation Loss of {:.4f} at epoch 
{}/{}".format(loss_ min, epoch, num_epochs)) 

print('Model Saved\n') 
print('Training Complete) 
print("Total Elapsed Time : {} s".format(time.time()-start_time)) 


angl rilco &sgi 
Le pores Ge lgdalie Lu à 2 all Le eDhel ob y (GI 3 sal pures ON Le 
LUI 


start_time = time.time() 
with torch.no_grad(): 


best_network = Network() 
best_network.cuda() 


best_network.load state dict(torch.load('/content/face landmarks.pth') 
) 


best_network.eval() 


images, Landmarks = next(iter(valid_ loader)) 


images = images.cuda() 
Landmarks = (landmarks ) 


predictions = (best_network(images).cpu() 
predictions = predictions.view(-1,68,2) 


plt.figure(figsize=(10,40)) 


img_num range(8): 
plt.subplot(8,1,img_num+1) 


plt.imshow(images[img_num].cpu().numpy().transpose(1,2,0).squeeze(), 
cmap='gray') 

plt.scatter(predictions{img_ num,:,0], 
predictions[img_num,:, ; ) 


(‘Total number of test images: {}'.format(len(valid_ dataset))) 


end_time = time.time() 
("Elapsed Time : {}".format(end_ time - start _time)) 
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Dog and Cat CNN plais hhüllg UUAJ ouai (18 
Classification using CNN 

CNN Jlüoaäo 

as nes jf pal ire ul JR (CNN) RG Lex RU pas à 
Les LGV Galésil 

SJ 45185 )l A2 plè rm es ko Ze 8)50 La] IN Las SUN 
RGB Go ,> JS els EL C6 Lise s coll EU Jb Le MSI 


SEM Je Gles Abe Lu Le Ga du RAGE IR Le pol 4,25 Loto 
sal Lex 


square Le yep) LAS N RU AE 5 JS Gal LS de LS ce Mu La 05 
filter :B ,< La, ,s5s SSI + patches 


patch äx5 Ji Lame (55luus cleuds 3) sell je pol dus 36 sèoe Lai se Elu Li 
kernel Ji, Cal eus s 
lié oJi al pub D fau 
import numpy as np # linear algebra 
import pandas as pd # data processing, CSV file 1/0 (e.g. pd.read_ csv) 
import cv2 


import matplotlib.pyplot as plt 


import tensorflow as tf 
from tensorflow.keras.models import Sequential 


from tensorflow.keras.layers import Dense, Flatten, Dropout, Activation, 
Conv2D 


JE SU p 525 cobiof 2,01 Gest ob ne Gien of 5 ge un Jp lis 


. 3Xe)| plu 8)eil cl pbs pod Qi ce Al ème io 3æ soil ; ail nez 
Je Jar cpl de 5 5 MOI mL Le 3 «cat.2s .dog. 15 .dog.0 Fe Asa Lu 


IS JE ie ed pq AS Jill lis | Jane) .83 mal 45 LS" cat" 5 +” dog" Je 
"0" ai de Lis "1" ai de 


SL Got 5 eS jle Ja RSS SI 1e de sages nl dl Ga 550 JS OI 


gr ol 5,405 fu Las, Bièes QUE, Le Gel,à cv2 &Se La pure 
4s5b) 
train_dir = # your path to train dataset 


path = os.path.join(main_ dir,train dir) 


for p in os.listdir(path): 
category = p.split[@](".") 
img_array = cv2.imread(os.path.join(path,p),cv2.IMREAD_ GRAYSCALE) 
new_img_array = cv2.resize(img_ array, dsize=(80, 80)) 
plt.imshow(new_img_array,cmap="gray") 


break 
La lue Il pe Aou ejzall CI Je ci pe DVI. 2 all gd 1 clef je Ji DS us 
QE pe X Da Lun y Bigoll 4 tools X EL Lol Lu juil Le eme ce cel 


ls Cest des US SI 8 LADU 1 5À 0 LI a y DKcus JR Che y 40 de gage 
0, 1 g' cat" À" dog' ui lle (I Los 


ge he game | Jos le Ge juil 5 JS i-b create test data ls LL 5 
Ed Les LOL 435 .X de ço2e dy BLEU 3 -80 x 80 15,5 pr Lex pi 
-Y & géo ail 
X[1 = 
yll = 
convert = lambda category: int(category == 'dog') 
def create _ test _data(path): 
for p in os.listdir(path): 
category = p.split[@](".") 
category = convert(category) 


img_array = 
cv2.imread(os.path.join(path,p),cv2.IMREAD GRAYSCALE) 


new_img_array = cv2.resize(img_ array, dsize=(80, 80)) 
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X.append(new_img_array) 
y.append(category) 
55e] LA Le, enumpy &yèee Ji ps X Jo Ge Y Caf s Il eut ol 
sbsl 3 SI X LES 
create _test_data(path) 
X = np.array(X).reshape(80,80,1 ,1-) 
y = np.array(y) 
4 JS JS EN EU 2550 pe poil ge ego eo gogue 45) ES X perd af 15 
Crnall 5e pures cpl os ES Re gas gs pe ON cpl al ce Lili BUS 
(Éd) pl col is Le 
ES SLI normalize &5 ékSe coms JR Es cues 45, lis (es as 
2550 5e pole Ge Le pl pra Of ee pl LS Lila Keras does plaiul 
1-0 5 L bé pl oz Ge J'nalls 255 Le louis Lis Les 1) 
JL JS Ge NU ds GA ac) 5 lei ode dass CLS .ebof olilaë Le ia 
| db Bo 
#Normalize data 
X = X/255.0 
&igoil Qu Jai 
model = Sequential() 
#Adds a densely-connected layer with 64 units to the model: 


model.add(Conv2D(64,(3,3), activation = "relu', input_shape = 
X.shapel[1:])) 


model.add(MaxPooling2D(pool_size = (2,2))) 
#Add another: 
model.add(Conv2D(64,(3,3), activation = 'relu')) 


model.add(MaxPooling2D(pool_size = (2,2))) 


model.add(Flatten()) 


model.add(Dense(64, activation='relu')) 


#Add a softmax layer with 10 output units: 


1 Pr 
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model.add(Dense(1, activation='sigmoid')) 


model.compile(optimizer="adam," 


loss="binary_crossentropy,' 


metrics=['accuracy']( 


oi ŒULS pe Le ge MEDG qu DVI 


SL Less vas D 8e S:Epochs cl 5] 


Bas Lo poele Lane 5 all LUI je :Batch size Lui ea 


Das pe Ga arte U (220 Ubu ox 3) SLI liée Le : Validation split 
doll se Gil 


validation _split=0.2) 


model.fit(X, 


V, epochs=10, batch_size=32, 


1 Train on 20008 samples, validate on 5088 samples 


2 Epoch 1/18 


3 20000/20000 ! 


4 Epoch 2/10 

5 20000/20000 
6 Epoch 3/10 

7 20000/20808 
8 Epoch 4/10 

9 20000/20008 
18 Epoch 5/10 
11 20000/20000 
12 Epoch 6/10 

13 20000/20000 
14 Epoch 7/10 

15 20000/20000 
16 Epoch 8/10 

17 20000/20000 
18 Epoch 9/10 

19 20008/200080 
26 Epoch 10/10 
21 20000/200080 


165 


14s 


14s 


14s 


14s 


14s 


14s 


14s 


14s 


14s 


79@us/step 


679us/step 


679us/step 


68Qus/step 


679us/step 


679us/step 


68Qus/step 


681us/step 


679us/step 


68Qus/step 


loss: 


loss: 


loss: 


loss: 


loss: 


loss: 


loss: 


loss: 


loss: 


loss: 


-61€ 


.498 


.45€ 


.405 


.367 


.318 


.27€ 


-215 


.164 


122 


SU bob ENN pis bles 5 1 si PREDICT gui oi 5 4 ol 
EN La 
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predictions = model.predict(X_ test) 
Àe sos gd Y r He Lez, sigmoid dis Lost és La del DE 
LYS 45 ll LULU 
predicted_val = [int(round(p[@]1)) for p in predictions] 
Eh LOU SLI Le page Jus Y Jus Y SU, JU] (Less of élue oNI 
submission_df = pd.DataFrame({'id':id line, 'label':predicted_val}) 
:csv él JS EL ll LL D est 


submission_df.to_csv("submission.csv", index=False) 
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Twitter Sentiment Analysis 95 LLc cluoll Jui (19 

Dex dusler JS oi at as, As les 42 Twitter Le ;eLiall Lis 
LCR Œ nl LCA Er25 ol sl C6 ol L Lis Ji Q Lou Y «yeNl 
ges pl le ef leu 605 LI Les 

Natural Language pliut 355 Le pli Ji pile ul oi (5 
Det pe nitk à el Processing 


Jüoi Qc yclitoJl Jui 
tele} sf ab Lis 
import numpy as np # linear algebra 
import pandas as pd # data processing, CSV file 1/0 (e.g. pd.read_csv) 


from sklearn.model_selection import train test _ split # function for 
splitting data to train and test sets 


import nltk 
from nltk.corpus import stopwords 


from nltk.classify import SklearnClassifier 


from wordcloud import WordCloud, STOPWORDS 


import matplotlib.pyplot as plt 


DU] äcgono JjÙ 


data = pd.read_csv('Sentiment.csv') 
#Keeping only the neccessary columns 
data = data[['text','sentiment']] 


A DEN de game ls Lou dosage (I DÉLAI sos nds cod 65 ts 
LONG ULI de seges ce 10 


De Je 3e ie DS dll oi il ebauf bu it lg Lt 
D Es Lila cle cl 
#Splitting the dataset into train and test set 
train, test = train_test_split(data,test_size = Q.1) 
#Removing neutral sentiments 


train=train{train.sentiment!= "Neutral"] 
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pe Ji qe ail Le some) LS Le ol al ad ei JE RS 
À gg Lol LISI 


Ale ON osent ds 5 tés canelle de all Ladies ces EU du 
Lil LV ol Et AS SN SLI Li 2,4  WordCloud pu; 


train_pos = train[ train['sentiment'] == "Positive'] 
train_pos = train_pos['text'] 
train_neg = train! train['sentiment'] == 'Negative'] 


train_neg = train_negl'text'] 


def wordcloud_draw(data, color = ‘black'): 


words = ".join(data) 


cleaned_word = .join([word for word in words.split() 
if 'http' not in word 
and not word.startswith('a') 
and not word.startswith('#") 
and word != "RT' 
D) 
wordcloud = WordCloud(stopwords=STOPWORDS, 
background_color=color, 
width=2500, 
height=2000 
).generate(cleaned_ word) 
plt.figure(1,figsize=(13, 13)) 
plt.imshow(wordcloud) 
plt.axis('off') 


plt.show() 


print("Positive words") 
wordcloud_draw(train_pos,'white') 


print("Negative words") 
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wordcloud_draw(train_neg) 


dent will 


rid Bush 


raising ratings 


2 reat: -a MD": next 
Cruz. Trump do nes 
Enjoyed looking 


U Ce 11 always 
“ Hcandidate":" 
€ Ask Trump 

< © )X ui N 

QU Dons Le “LO SENS 

pieds together 
Fe Oquestion Look Ben çar r+ on 

4 = 

U 

ON 

O 

1 


Wallace enr 


Eair MP mp Balanced 


debate::: 


time 


ush ireminds 


Last. night 


Jo Le ycliol Jui 122 


truth le) SSI de sage GR I Las SSII Be plaadU véall 5e 


job «love «together «legitimate «strong 


£rûs truthful Ge Jill semis ob eV Ji UN Le usé à 
ils Al @ 3 legitimate 

news «influence :fe ES de ALU Cl xIl Goes eau Ci 5 
trying «cherry picking «makeup :softball disappointing «elevatormusic 


.cherry picking ; SN 5 sèle 
stopwords Lil Se DL ei evizualization SLI Lu, LI bozell de 
ŒUdul less) Loge Lanl le ses Y Cal SSI a : Stop Words 

El 
a gba cge SE 55 2 5 EN ons laut je GlaSl os Lies 2 Le Éole 
Cal this «for the) .&,, 2 

tueets =[] 


stopwords_set = set(stopwords.words("english")) 


for index, row in train.iterrows:() 


words_filtered = [e.lower() for e in row.text.split() if len(e) >= 
3] 


words_cleaned = [word for word in words _filtered 
if ‘'http' not in word 
and not word.startswith('a') 
and not word.startswith('#") 
and word != 'RT'] 


words_without_stopwords = [word for word in words cleaned if not 
word in stopwords_ set] 


tweets.append((words_ without_stopwords, row.sentiment)) 


test_pos = test[ test['sentiment'] == "Positive'] 


9 bU la Au 810 ci rlcil 8 lcgyiro 20 


test_pos = test_pos['text'] 


test_neg = test[ test['sentiment'] == 'Negative'] 


test_neg = test_negl'text'] 


ot Gb 5e Soi enltk lib plat slot elial plie cui It 8 as 
LU ms Lits à coll 25 il 
#Extracting word features 
def get_words_in_tweets(tweets): 
alt[] = 
for (words, sentiment) in tweets: 
all.extend(words) 


return all 


def get_word_features(wordlist): 
wordlist = nltk.FreqgDist(wordlist) 
features = wordlist.keys() 
return features 


w_features = get word features(get_words_ in _tweets(tweets)) 


def extract_features(document ): 
document_words = set(document) 
features{} = 
for word in w_features: 
features['contains(%s)' % word] = (word in document words) 
return features 
8 La QU Jon las plus 595 SLA IS SSI pus ci ia cr ses 
.debate nights 


wordcloud_draw(w_features) 
sua CUS we cases wi .nltk NaiveBayes Classifier plu 
À mall 


#Training the Naive Bayes classifier 
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training set = nltk.classify.apply_features(extract_features,tweets) 


classifier = nltk.NaiveBayesClassifier.train(training set) 


iraall Le; le Jees LES ls Je ASS Le aolée pleut dl 


neg_cnt 0 


0 


pos_cnt 
for obj in test_neg : 
res = classifier.classify(extract_features(obj.split())) 
if(res == 'Negative') : 
neg_cnt = neg_cnt + 1 
for obj in test_pos : 
res = classifier.classify(extract_features(obj.split())) 
if(res == ‘Positive') : 


pos_cnt = pos_cnt + 1 


print('[Negativel: %s/%s ‘" % (len(test_neg),neg_cnt)) 


print('[Positivel: %s/%s ‘" % (len(test_pos),pos_cnt)) 


[Negative]: 842/795 


[Positive]: 220/74 


Next Word Prediction Model äylil äo14Jl GLUUI & 19-05 (20 
google pass JU LIQL get lie LS Cl sel Gas SJ dire puis 
SUUI 535 Ci eu EL Lui bel Jane JL Steel LUN LS 25 5 Lai 
de JS QUI LIQL BU LUI Lg pilèoll 2e Ab s Glue Alaall 
Det pleut QU LUS GE Geull Abel piges cute pile lai six 

HU LISE LUI 754) 5 GKerass Tensorflow Se pit 


ele LB (RNN) 5, huge 8 Cons pile GILI LUI 55 pis pue) 
«C3 5 (6 ele) 5 55 DVI Logall 
&yUUI à oLAJU LUI eA90j 
:logoll sig) Lez A SSII Lèx 3) gui Les JUN LIL EU T5 sais AN 
import numpy as np 
from nltk.tokenize import RegexpTokenizer 
from keras.models import Sequential, load_model 
from keras.layers import LSTM 
from keras.layers.core import Dense, Activation 
from keras.optimizers import RMSprop 
import matplotlib.pyplot as plt 
import pickle 
import heapq 
AN LIL ES Joal Le ll miel Ju Google pis le ii LS 
pliée LUI LIL GT Le & oil alé les ace s LS Cl sil on us 
La ge DUO Re some 55 Ses bb Le game Lai prsule SU SUN ax 
ras (axe Le le due 55 As SUQI Jess Les off 
path = ‘1661-0.txt' 
text = open(path).read().lower() 


print('corpus length:', len(text)) 


corpus length: 581887 


SN ère 3975 059 Os LL LS JS I SU de same poudts ON pile 
Lil 


tokenizer = RegexpTokenizer(r'w+') 
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words = tokenizer.tokenize(text) 


riett ! ! 


['project', ‘gutenberg', 's', ‘the’, 'adventures', ‘of, 'sherlock', 'holmes', 'by", ,'our", 
‘email’, newsletter", ‘to', ‘hear’, ‘about, new’, 'ebooks'] 
lg EL Gfeature engineering lil Loue Les a QUI &leal à Se NI 
clies in il Sal Less Eu SLI GUIS JS Le ss LEE he «xl 
value LS dogs ele A5 «key 
unique _words = np.unique(words) 
unique _word_index = dict((c, i) for i, c in enumerate(unique words)) 
Cl jLoJI äauaia 
Leslie LS pli) I Lele 5285 5 LUSe 32e Lou a glas (1 A opel Lune a 
ehnell Rata ce Sos Lada Gubs d5 CiS TB) Lu LOL ele 43 sève el 
La ue ae LS é 


sde AU LolS se a LU LS se ace (SU LS Usb La sat 
LG lg lat DEN CLS LUI most LS Le Eli LUI LelSUl 
JU LUS 


WORD_LENGTH = 5 


prev_words[] 

next_words[] = 

for i in range(len(words) - WORD_ LENGTH): 
prev_words.append(words[i:i + WORD_LENGTH]) 
next_words.append(words[i + WORD_LENGTH]) 

print(prev_words[@]) 

print(next_words[@]) 


RARE 


['project’, 'gutenberg;, 's', ‘the’, 'adventures'] É 
Lg all Lol 52 y olll 52 x nb 0 one sde eLéoL ps5f se ON 
1 dl pe sal pres one dort LUI ST) vs x 3,5 


X =  np.zeros((len(prev_ words), WORD_LENGTH, len(unique_words)), 
dtype=boo1) 


Y = np.zeros((len(next_words), len(unique words)), dtype=bool) 
for i, each_ words in enumerate(prev_words): 
for j, each _ word in enumerate(each words): 


X[i, j, unique word_index[each_ word]] = 1 
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Y[i, unique word_index[next_words[i]]] = 1 
HIS 4e es due Je 815 GAS éooll 15 ON 
print(x[01[01) 
[False False False … False False False] 
ôJJ4ioJl äuacl äà4uiul Li 
La LUN RQ BU pige 8, Sell Lau RU past le oo S5 LS 
Que 65 RNN 323 LSTM pi gai paul 
model = Sequential() 
model.add(LSTM(128, input_shape=(WORD_ LENGTH, len(unique words)))) 
model.add(Dense(len(unique_ words))) 
model.add(Activation('softmax')) 
YUUI &o14Jl 4 9j Ego Lu Jai 
La 20 GRNN LUS GE m3 505 uns pl 
optimizer = RMSprop(lr=0.01) 


model.compile(loss='categorical_crossentropy", optimizer=optimizer, 
metrics=['accuracy']l) 


history = model.fit(X, Y, validation split=0.05, batch size=128, 
epochs=2, shuffle=True).history 


JAN os Sue dl s05 one US Call [5 cgrlans Led ei Luis DVI Les A5) 
JÆmll flat Y 75 sel ha Li 

model.save('keras_next_word_model.h5"') 

pickle.dump(history, open("history.p", "wb")) 

model = load_ model('keras_next_word_model.h5') 


history = pickle.load(open("history.p", "rb")) 


&yJUUI ol AU GUUUI € 39.05 ui 
LS Les 255 ol Le Éy Lerdsei 5 LAS le dupe 85 A0 OÙ Les 
es JA 
plt.plot(historyl'acc']) 
plt.plot(historyl'val_acc']) 
plt.title('model accuracy') 


plt.ylabel('accuracy') 
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plt.xlabel('epoch') 


plt.legend([l'train', 'test'], loc='upper left') 


model accuracy 
0.600 


——— train 
0.575 _— test 


0.550 


Oo 
ao 
N 
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accuracy 
[=] 
[ea] 
[=] 
[=] 


0475 
0.450 
0.425 


0.0 25 5.0 7.5 10.0 12.5 15.0 17.5 
epoch 


plt.plot(history['loss']) 
plt.plot(historyl'val_loss']) 
plt.title('model Lloss') 
plt.ylabel('loss') 
plt.xlabel('epoch') 


plt.legend(['train', ‘'test'], loc='upper left') 
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def prepare_input(text ): 


x = np.zeros((1, SEQUENCE LENGTH, len(chars))) 
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for t, char in enumerate(text ): 


x[0, t, char_indices[char]] = 1. 


return x 
-JU-5Yelbel eL5 lowerO Üls plasel a JG YU 255 Les 5 all L5 OI 
prepare_input("This is an example of input for our LSTM".lower()) 


array([[[ @., @., @., ..., 0., @., @.], 


5) 
[ 8., @., @., ..., 0., @., @.], 
[ 0, 10.10, :.., 187, 0, 0.1 
[M0 0.000 7... 01, 0.101: 
[20/0 00. 0 010.1, 
[ @., @., ., @., @., 0.]]]) 


Chiens + 
ads ce Sas Ge (CUS ds) Ge 40 5e 035$ Of cons culs of BY 
sl WII CSI Le Gas 5 gadl JS (1640457) JA 5 5e QÙ se 

ee JS Je Lasli] 
def prepare_input(text ): 
x = np.zeros((1, WORD LENGTH, len(unique words))) 
for t, word in enumerate(text.split()): 
print(word) 
x[@, t, unique word_index[word]] = 1 
return x 
prepare_input("It is not a Lack".lower()) 
array([[[ @., @., @., ..., 0., 0., 0.], 
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[NO 0 10, 2. Pr, 10100: 
[ 8., @., @., ..., @., @., 
[ 10: 0, 10. 2502, 02, (0! 


© 
ts 


sol pl Y ls LL es NI 
def sample(preds, top_n=3): 
preds = np.asarray(preds).astype('float64') 
preds = np.log(preds) 
exp_preds = np.exp(preds) 


preds = exp_preds / np.sum(exp_preds) 


ayUUI &oJAJU ELU 8 igoi 130 


return heapq.nlargest(top_n, range(len(preds)), preds.take) 
JEU LG EU Us LL 8 5Le DVI 
def predict_completion(text ): 
original_text = text 
generated = text 
completion'' = 
while True: 
x = prepare_input(text) 
preds = model.predict(x, verbose=0)[0] 
next_index = sample(preds, top _n=1)[0] 
next_char = indices _char[next_indexl] 
text = text[1:] + next_char 


completion += next_char 


if Len(original_ text + completion) + 2 > Len(original_text) and 


next_char:' ‘"— 
return completion 
LS be ébs rés Lu sl] > JU LIL Gel HUIT oja LS 5 
Jens pile DT otee ss D obul RNN 75505 clans (GiUls «ls 
Bodce > gel odtel 3,,55ll Wu 

def predict_completions(text, n=3): 

x = prepare_input(text) 

preds = model.predict(x, verbose=0)[0] 

next_indices = sample(preds, n) 


return [indices _char[idx] + predict_completion(text[1:] + 
indices_char[idx]) for idx in next_indices] 


LL 9 LS alascul LS GUls G > 40 55e doll Jubersl pusrute OS 


quotes =[ 


It is not a lack of Love, but a lack of friendship that makes 
unhappy marriages,". 


That which does not kill us makes us stronger,". 
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l'm not upset that you lied to me, l'm upset that from now on I 
can't believe you,". 

ù And those who were seen dancing were thought to be insane by those 
who could not hear the music,". 

d It is hard enough to remember my opinions, without also remembering 
my reasons for them"! 


[ 
LIN LISE RU 25 sel pleut ES et NI 
for q in quotes: 
seq = q[:40].1ower() 
print(seq) 
print(predict_completions(seq, 5)) 
print() 


it is not a lack of love, but a lack of 

['the ‘, ‘an ‘, ‘such ‘, ‘man ‘, ‘present, ‘] 

that which does not kill us makes us str 

['ength ‘, ‘uggle ‘, ‘ong ‘, ‘ange ‘, ‘ive '] 

i'm not upset that you lied to me, i'mu 

['nder ‘, ‘pon “Ses, ti, CUder cn] 

and those who were seen dancing were tho 

['se ‘, ‘re ‘, ‘ugh ‘, " servated ‘, ‘t ‘]Jit is hard enough to remember my 
opinion 
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